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MULTIDIMENSIONAL STIMULUS DIFFERENCES AND 
ACCURACY OF DISCRIMINATION ! 


CHARLES W. ERIKSEN AND HAROLD W. HAKE 


The Johns Hopkins University 


Can stimuli be more accurately 
discriminated from one another if they 
differ on two or three dimensions than 
if they differ along a single dimension? 
And, can the discrimination accuracy 
for such multidimensional stimuli be 
predicted from the discriminability of 
the component dimensions? Little 
experimental work has been performed 
on these specific questions, but there 
are several reports in the literature 
that indicate multidimensional cues 
facilitate learning and reduce reaction 
time. Eninger (4) found that rats 
‘learned a discrimination problem more 
rapidly when visual and auditory cues 
were compounded than when either 
auditory or visual cues were used 
alone. Similar findings were obtained 
by Miller using compounds of visual 
and auditory modalities in a classical 
conditioning study (10). In both 
these studies compounding was across 
different sense modalities. However, 
Weitz (12) has shown that reaction 
time is reduced when two dimensions 
within the same sense modality are 


1 This research was performed under Contract 
No. AF 33(038)—22642 between the Institute of 
Cooperative Research, The Johns Hopkins Uni- 
versity, and the Psychology Branch of the Aero 
Medical Laboratory, Directorate of Research, 
Wright Air Development Center. 


compounded. He found that re- 
action time to control levers was more 
rapid if the knobs differed in both hue 
and form than if they differed only in 
hue or only in form. 

Several other studies have indi- 
cated, however, that the over-all 
perceptual situation determines in 
some cases whether or not com- 
pounded or multidimensional cues 
contribute to discrimination. Eriksen 
(5) found that compounding cues on 
the dimensions of hue, form, size, and 
brightness did not increase the speed 
with which objects could be located in 
a visual display. He found that this 
failure to improve location time was 
attributable to a perceptual hetero- 
geneity factor that resulted from 
introducing more dimensions of vari- 
ation into the display. More recently 
Bricker (1) has shown that redundant 
position cues were not useful in learn- 
ing to associate labels to a set of visual 
forms. 

In general, then, the results of work 
in this area have been to suggest that 
the effects of compounding dimen- 
sional cues depend in some measure 
upon the response studied and the 
nature of the task. In none of the 
cases has there been a direct effort to 
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assess the effects of compounding di- 
mensions in terms of accuracy of 
discrimination or the number of dis- 
criminable steps in a stimulus series 
that can be obtained. In the present 
study both discrimination accuracy 
and the number of reliably discrimi- 
nable steps have been determined for 
a series of stimuli varying along the 
dimensions of size, hue, and bright- 
ness. The discrimination measures 
obtained for stimuli varying along 
these single dimensions were then 
compared with the discrimination 
measures obtained when a stimulus 
series varied on size and hue, size and 
brightness, hue and brightness, and 
size, hue, and brightness. 


METHOD 


Design.—Discrimination was determined by 
the method of absolute judgment, or as it is 
sometimes called, the method of stimulus rating. 
The method requires Ss to identify by a name, 
number, or value each member of a set of indi- 
vidually presented stimuli. In the present 
study, Ss used the numbers from 1-20 in order to 
identify which stimulus had occurred. The 
absolute method was chosen as a measure of 
discrimination since it is amenable to an analysis 
in terms of information theory. Garner and 
Hake (7) have shown that the information trans- 
mitted by Ss about a series of stimuli along a 
dimension is interpretable as the number of 
stimuli that can be selected from this dimension 
to yield perfect discrimination under absolute 
conditions. 

Seven different series of stimuli were judged 
in the present study. In three of these series the 
stimuli composing the series differed from one 
another along a single dimension. These di- 
mensions were, respectively, size, hue, and 
brightness. In the remaining four stimulus 
series multidimensional stimulus differences were 
employed. In three of these four remaining 
series each stimulus in the series could be 
uniquely specified on each of two dimensions, 
and in the fourth series each stimulus could be 
specified on all three dimensions. The bright- 
ness and the size-brightness series each contained 
17 stimuli, and 20 stimuli were used in each of the 
remaining series. 

Each of six Ss made 100 judgments of each 
stimulus in each of the seven stimulus series. 
The Ss completed all judgments of a series before 
beginning their judgments on the next series. 
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TABLE 1 
SpecIFICATION OF STtimuLUS CATEGORIES 
Usep ror Srze, Hue, Bricutness, 
anp Hve-BriGHTNESS 


























Stimulus Dimensions 
Response 

Gavea)| Hee | "ESS*! prighemess 

1 1/8 |SR S/ON1/|5R 4/4 
2 2/8 |10R 5/6IN1.5/1I0R 5/4 
3 3/8 | 5 YR5/6\N 2/ | 5 YR6/4 
4 4/8 |10 YR 5/6\N 2.5/|10 YR 7/4 
5 5/8 |5Y 5/6N3/ | SY 8/4 
6 6/8 |10Y 5/6\N 3.5/\10Y 7/4 
7 7/8 | 5GY5/6IN 4/ | 5 GY 6/4 
8 8/8 |10 GY 5/6|N 4.5/|10 GY 5/4 
9 9/8 |5G S/6iIN5/ |5G 4/4 
10 10/8 }10G 5/6IN5.5/)10G 3/4 
11 11/8 | 5 BG 5/6\N 6/ | 5 BG 2/4 
12 12/8 |10 BG 5/6\N 6.5/|10 BG 3/4 
13 13/8 |5B 5S/6IN7/ |5B 4/4 
14 14/8 |10B 5/6IN7.5/10B 5/4 
15 15/8 | 5 PB 5/6\N 8/ | 5 PB 6/4 
16 16/8 |10 PB 5/6\N 8.5/|10 PB 7/4 
17 17/8 |5P 5S/6IN9/ |5P 8/4 
18 18/8 |10P 5/6 .... |10P 7/4 
19 19/8 | 5RP 5/6 5 RP 6/4 
20 20/8 |10 RP 5/6 10 RP 5/4 





Note.—The specifications for hue and brightness are 
given in the Munsell notation. For example, 10 R 5/6 
indicates that the hue is red with a brightness value of 
5 and a saturation of 6 on the Munsell scales. 


To control for practice effects, Ss made their 
judgments of the seven stimulus series in a 
counterbalanced order.” 

Materials and apparatus.—Table 1 defines the 
stimuli used in the size, hue, brightness, and the 
hue-brightness series and shows the response 
number that was assigned to each. The other 
three multidimensional series are not represented 
in this table since they were constructed by com- 
pounding stimuli of the single dimensional series. 
In constructing these multidimensional series the 
component single dimension stimuli were super- 
imposed one on top of the other. Thus for the 
series size-hue, Stimulus 1 was a 1/8 in. 5 R 5/6 
square, Stimulus 2 a 2/8 in. 10 R 5/6 square, 
Stimulus 3 a 3/8 in. 5 YR 5/6 square, etc. In 


2 The design followed a 7 X 7 latin square 
which was not quite realized owing to the seventh 
S’s failure to complete the experiment. How- 
ever, statistical examination of the data revealed 
no evidence of systematic practice’ effects from 
one series to the next. 

3 The multidimensional series of hue-bright- 
ness was an exception to this method of con- 
struction and as a result has been presented in 
Table 1. This exception was necessitated by the 
nonindependence of hue and brightness and the 
inability to obtain Munsell hue in half brightness 
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constructing the multidimensional series of size- 
hue-brightness, the hue-brightness stimuli were 
compounded with the size stimuli in the manner 
described above for size-hue. 

As can be seen in Table 1 the response num- 
bers were assigned to stimuli in such a manner 
that the serial order of the response numbers 
conformed to the position of the stimuli along 
the dimension(s). 

The stimuli for the size series were cut from a 
dark gray paper and for the hue, brightness, and 
hue-brightness series 7/8-in. squares of Munsell 
papers were used. For all series the stimuli were 
mounted in the center of 3-in. white cardboard 
squares to facilitate handling. 

Procedure—The Ss were run individually 
with an average of five 1-hr. sessions being re- 
quired to judge the stimuli in any one series. 
Prior to beginning the experimental judgments 
on a given series, each S made eight practice 
judgments with correction of each stimulus in 
the series. Following the practice judgments 
100 judgments of each stimulus were obtained 
during the experimental sessions. In all, each 
S made a total of 13,400 experimental judgments 
over the seven stimulus series during the experi- 
mental sessions. 

As was mentioned above, all stimuli were 
mounted on white cardboard squares. Four of 
these stimulus cards were constructed for each 
stimulus. The stimulus cards were presented to 
S one at a time and his judgment recorded. The 
experimental sessions were broken up into blocks 
of judgments where a block was defined as four 
judgments on each stimulus in the particular 
series. Between blocks S was given a short rest 
while £ thoroughly reshuffled the stimulus cards 
in order to insure a random order of appearance 
of the different stimuli. 

Subjects —The six Ss were male under- 
graduate students at The Johns Hopkins Uni- 
versity. None had previously served as S in a 
psychophysical experiment. All were screened 
by a shortened version (3) of the Ishihara Color 
Vision test to insure that they were color normal. 


RESULTS 


Two different methods have been 
used to express the discrimination 


steps. It will also be noted in the table that the 
hues in this series have a saturation of 4 in the 
Munsell system while in the hue and the size-hue 
series all hues have a saturation of 6. Since the 
hues are of a different saturation and the bright- 
ness differences between adjacent stimuli are a 
full brightness step, the results obtained with 
this multidimensional series are not directly 
comparable to those obtained with the other 
multidimensional! series. 


accuracy obtained with the three 
single and the four multidimensional 
stimulus series. The first of these 
was to determine the information 
transmitted in bits (/,) between the 
stimuli and responses in each series. 
Garner and Hake (5) have shown that 
the antilogarithm to the base 2 of /, 
is interpretable as the maximum num- 
ber of stimulus values that could be 
selected from a series of stimuli if 
errorless discrimination were desired. 
The second method consisted of com- 
puting the average percentage of 
correct responses given to the stimuli 
within each series. 

As a first step in evaluating the 
results an over-all test of the sig- 
nificance of differences in discrimi- 
nation accuracy among the different 
series was made. The value of J; for 
each S was computed for each of the 
seven series. These /,; values were 
then subjected to a two-way analysis 
of variance (stimulus series and Ss). 
The resulting F of 20.48 (P > .0001) 
demonstrated that the variation in 
accuracy between series was much 


TABLE 2 


DiscRIMINATION ACCURACY FOR THE 
Turee SINGLE AND THE Four 
MULTIDIMENSIONAL STIMULUS 

SERIES 








Average Per Cent 
No. Stimuli/Correct Judgments 


Stimulus Series | J: Absolutely 





— 
namie | Observed | Ere. 
Size 2.84 7.19 47.5 
Hue 3.08 8.45 53.4 


Brightness 2.34 5.06 41.3 
Size-Hue 3.55} 11.90 76.2 | 72.6 
Brightness | 2.98 7.89 59.8 | 59.5 
Brightness |3.76} 13.55 84.2 — 


Size-Hue- 
Brightness | 4.11] 17.28 96.5 90.7 

















Note.—Based upon error term for the analysis¥of 
variance, differences of .38 and .52 in J; are significant 
at the .05 and the .01 level, respectively. 
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greater than could be reasonably at- 
tributed to chance. 

The discrimination accuracy ob- 
tained with each of the seven series 
isshownin Table 2. The first column 
of this table gives the average value 
of J, for the six Ss. In the second 
column the antilog of J; to the base 2 
is shown. Thus the values in this 
column represent the best estimate of 
the maximum number of stimulus 
categories or values that could be 
selected from the different series if 
errorless discrimination were to be 
obtained. 

As can be seen from Table 2, there 
is a marked gain in discrimination 
when two or three dimensions are 
compounded. This is true irrespec- 
tive of which measure of discrimi- 
nation is considered. The use of the 
multidimensional series of size-hue 
results in significantly more accurate 
discrimination than is obtained by 
either the size or the hue dimension 
used alone. The size-brightness and 
the hue-brightness series also yield 
better discrimination than is obtained 
by any of the component dimensions 
used singly; and, in the case of the 
series size-hue-brightness where stim- 
uli differed from adjacent stimuli in 
the series on each of these dimensions, 
discrimination was nearly perfect for 
all 20 stimuli. 


This improvement in discrimination 
obtained by compounding dimensions 
indicates at least a moderate degree of 
independence in the accuracy with which 
two or more dimensions are simultane- 
ously judged in the same stimulus. That 
is, the improvement could be expected 
only in the event that S could simulta- 
neously judge the stimuli on more than 
one of the component dimensions and 
that his judgments about values along 
these dimensions were not completely 
determined by his judgments of values 
along one of them. In the extreme case, 
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TABLE 3 


DisTRIBUTION OF RESPONSES IN PER CENT To 
THE 8/8-In. SQUARE.IN THE S1zE SERIEs, 
THE 10 GY rn THE Hue Series, AND To 
THE COMPOUND OF THESE Two 











VALUES IN THE S1zE-HvE 
SERIES 
(N = 600) 

Response | Size | Hue | Spe-Hus | Siee Hue 
5 0.3 0.0 0.0 0.0 
6 0.7 0.0 0.0 0.0 
7 16.8 0.8 1.7 2.1 
8 46.2 49.6 81.5 72.8 
9 24.0 19.8 12.3 18.3 
10 8.8 17.5 4.3 5.6 
ll 3.0 9.5 0.2 1.2 
12 0.2 aa ¢ 0.0 0.0 

















for example, where S’s perception of 
values along all other dimensions in a 
series of compound stimuli was com- 
pletely determined by his perception of 
values along just one of the dimensions, 
then his ability to discriminate among 
the compound stimuli could be no better 
than his ability to discriminate among 
stimuli differing only in the single di- 
mension. Since accuracy of judgment 
for compound stimuli was considerably 
better in this experiment than that for 
stimuli differing on a single dimension, a 
considerable degree of independence of 
judgment along the component dimen- 
sions in the compound stimuli must be 
assumed. 

The results obtained with the multi- 
dimensional series can be _ predicted 
rather well by the use of three assump- 
tions concerning discrimination behavior 
under the present conditions. They are: 


1. At a given moment in time, the 
response tendency (rating) evoked by a 
magnitude on one dimension is inde- 
pendent of the response tendency evoked 
by magnitudes on other dimensions. 

2. The distribution of response fre- 
quencies to a stimulus magnitude is a 
measure of the relative strengths of re- 
sponse tendencies elicited by that magni- 
tude. 

3. In the case of competing response 








TO 
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tendencies, the stronger tendency will be 
evoked as a response. 


In Table 3 these assumptions have 
been used to predict the distribution of 
response to Stimulus 8 (8/8-in. 10 GY 
square) in the multidimensional! size-hue 
series. The first column of this table 
shows the response distribution obtained 
for this size (8/8-in. square) when it 


occurred as Stimulus 8 in the size series * 


and the second column shows the distri- 
bution of responses to this hue (10 GY) 
when it occurred as Stimulus 8 in the hue 
series. The third column is the response 
distribution to the stimulus compound of 
these two stimulus magnitudes or values. 

If we consider the first two columns as 
reflecting the strengths of response tend- 
encies to the size and hue magnitudes or 
values when viewed separately and as- 
sume also that the response tendencies 
evoked by each dimension are completely 
independent of each other, then we can 
expect that on 16.8% of the appearances 
of compound Stimulus 8 response tend- 
ency “7’’ would be evoked by the size of 
the stimulus and on 49.6% of its appear- 
ances response tendency “‘8”’ evoked by 
its hue. Since it is assumed that these 
two events are independent, we may 
expect that on about one-half (49.6) of 
the occasions on which S perceived the 
size of Stimulus 8 as being “‘7” a stronger 
tendency to respond “‘8”’ will be evoked 
by the hue of the stimulus. On all oc- 
casions in which these two events occur 
together the stronger tendency will 
appear as an overt response, and, by this 
means, about one-half of S’s underesti- 
mations of the size of the compound 
stimulus by one scale value will be cor- 
rected by his more accurate perception 
of the hue. 

In a similar way we may consider all 
the other possible events that could arise 
from these two distributions. This es- 
sentially reduces to multiplying each 
response percentage for the size di- 
mension by each of the response per- 
centages for the hue dimension. The 
product in each case is assigned to the 
response category that has the highest 
percentage value. For example, .3%for 


Size Response 5 would be multiplied by 
.8% for Response 7 for hue and the 
product .24% would be assigned to 
Response 7 since it had the stronger 
response tendency. 

Column 4 in Table 3 is the predicted 
response distribution for Stimulus 8 in 
the size-hue series. It was obtained by 
following the procedure which has been 
described through the 48 multiplications 
involved and then summing the products 
assigned to each response category. The 
predicted distribution is a fair approxi- 
mation to the empirical one. 

The distribution predicted in Table 3 
is for only one of the compounded 
stimuli, however, and is not an adequate 
test of the predictive model. Accord- 
ingly, the procedure was carried out for 
each stimulus in the size-hue, size- 
brightness, and the size-hue-brightness 
series.‘ The predicted percentage of 
correct judgments for each stimulus was 
then averaged for each series. These 
average predicted percentages of correct 
responses are shown in Column 4 of 
Table 2 where they can readily be com- 
pared with the obtained values. 

Comparison shows the predicted per- 
centage of correct responses to be very 
close to the empirical results. The 
largest discrepancy is 5.8% for the multi- 
dimensional series size-hue-brightness, 
but it is less than 1% for size-brightness. 
The closeness of the fit can be further 
demonstrated by computing the rank- 
order correlation between the predicted 
percentage of correct judgments for each 
stimulus in a series and the obtained 
percentage of correct judgments. For 
the size-hue series this rank-order cor- 
relation is .86 for the 20 stimuli, and for 


4 It was not possible to obtain such predictions 
for the hue-brightness series since, as was pointed 
out in the section on procedure, this multidi- 
mensional series was not a simple compound of 
the stimuli used in the hue and brightness series. 
Owing to limitations of Munsell hues it was 
necessary to use a full brightness step between 
adjacent hues in this multidimensional series, 
whereas in the brightness series alone, half 
brightness steps were employed. Consequently, 
it was necessary for the same brightness values 
to be compounded with several different hues. 
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the 17 stimuli in the size-brightness 
series, rho is .77. A similar rank-order 
correlation for the size-hue-brightness 
was not practicable since the accuracy of 
discrimination for this series was nearly 
perfect and there was little or no vari- 
ation among the predicted or obtained 
judgments of the stimuli. 


Discussion 


The results of the present study show 
quite clearly that compounding dimen- 
sional differences among stimuli results 
in a large and significant improvement 
in discrimination using the absolute 
method of judgment. In effect, the 
results show also that the small values 
for J,, the maximum number of stimuli 
which can be discriminated with essen- 
tially no error, which have been obtained 
by using the absolute method (6, 8), are 
not necessarily the result of limitations 
imposed by the method itself but rather 
by the paucity of perceptual cues pro- 
vided by stimuli varying on a single 
dimension. The very large numbers of 
different objects and events which are 
separately recognizable and identifiable 
in the world, then, may be at least par- 
tially explicable in terms of the multi- 
plicity of cues available for discrimi- 
nation. 

Generalization from our results should 
be attempted with some caution, how- 
ever, because not all perceptual tasks, 
even of the absolute judgment type, are 
identical with the tasks performed by 
our Ss. The importance of the particular 
perceptual task and the performance 
measure studied has been pointed out 
earlier in this paper to be a factor in 
determining the effect of compounding 
dimensional differences between stimuli 
(1, 5). 

It is important to recognize also that 
the improvement in judgmental per- 
formance found, when stimuli were made 
more complex by compounding two or 
more dimensions in the same stimulus 
series, resulted primarily from the ability 
of Ss to make fairly independent judg- 
ments of stimulus values along each of 
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We cannot 
assume that this is always the case for 


the component dimensions. 


compound stimuli. For some stimuli, 
judgments made of values in the separate 
dimensions may be interrelated. That 
is, the evocation of a particular response 
tendency by a component of a compound 
stimulus may change the likelihood of 
evocation of other response tendencies by 
other components of the stimulus. We 
could expect this to occur when Ss have 
learned by long experience to associate 
the occurrence of certain values in one 
dimension with the occurrence of par- 
ticular values in another. For example, 
we have assumed on the basis of the data 
in Column 1 of Table 3. that Stimulus 8 
in the size-hue series would evoke re- 
sponse tendency “7” 16.8% of the time. 
If our Ss had had any previous firm 
knowledge that lower-numbered sizes 
tended to occur with lower-numbered 
hues, we should not expect them to be as 
likely to assign higher numbers to the hue 
of the compound stimulus, given that 
they perceived the size to be “7,” as they 
would if judging the hue in isolation. 
And, in that case we could not expect Ss 
to correct their underestimations of size 
as often by making correct estimates of 
the hue values. 

It is probable that most of the stimuli 
which we perceive outside the psycho- 
physical laboratory involve such inter- 
dependencies among the several stimulus 
dimensions of which they are composed. 
A relationship between the area of a 
stimulus and its apparent intensity has 
been reported by Hanes (9). Other 
examples include the relationship be- 
tween pitch and loudness (11) and the 
relationship between the proximity of 
adjacent parallel lines in the visual world 
and the likelihood that the lines are a 
part of a cohesive object in the world 
(2). We have included among our ex- 
amples interdependencies which may be 
due to the nature of our receptor mecha- 
nisms and to the nature of our world. 
Both factors undoubtedly are concerned 
here. 

Another very real consideration quali- 
fying our results is the possible effect of 
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compounding a very large number of 
stimuli in a single stimulus series. This 
may result in a loss of ready awareness 
by Ss of some of the dimensions involved. 
If this were true, we should expect there 
to be some upper limit to the amount of 
improvement in discrimination which can 
be obtained by compounding dimensions. 

There is one other aspect of the present 


data that deserves comment. This con-* 


cerns the number of absolutely identi- 
fiable or discriminable steps that can be 
obtained with the hue and brightness 
dimensions. Since the entire range of 
visible hue and nearly the entire range of 
brightness was used in these respective 
stimulus series, the number of absolutely 
discriminable steps obtained for these 
dimensions may be considered as close 
approximations to the maximum ob- 
tainable. As was seen above, very few 
values could be selected from the bright- 
ness dimension so as to satisfy the 
criterion of perfect discriminability under 
absolute conditions. The data suggest 
a maximum of about 5 values. The 
maximum number of values that could 
be selected from the hue dimension to 
meet this criterion is somewhat larger. 
Here the data indicate that at least 8 
hues of equal brightness and saturation 
could be so selected. This number for 
hue is somewhat less than the number 
that has been estimated by Halsey and 
Chapanis (8). Their data indicated that 
between 10 and 12 hues could be abso- 
lutely discriminated. 

This discrepancy between the two sets 
of data seems most likely to be due to 
differences in amount of practice given 
Ss. Halsey and Chapanis practiced 
their Ss until there was no further learn- 
ing from trial to trial. If only the last 
20 judgments of each of the 20 hue 
stimuli are considered in the present ex- 
periment, it is found that the Ss were 
transmitting an average of 3.44 bits of 
information from the hue dimension. 
The antilog to the base 2 of this value is 
11, which is quite comparable to the 
figure arrived at by Halsey and Chapanis. 
A comparable computation for the 
brightness series results in an average 


of 2.59 bits which indicates about 6 steps 
on the brightness dimension are abso- 
lutely discriminable. 


SUMMARY 


The present study was concerned with the 
contribution of multidimensional stimulus differ- 
ences to accuracy of discrimination. Discrimi- 
nation accuracy was determined by the method 
of absolute judgment for a series of stimuli 
varying along the dimension of size, of hue, and 
of brightness. The discrimination measures 
obtained for these stimuli varying along these 
single dimensions were then compared with the 
discrimination measures obtained when a series 
of stimuli varied on size and hue, size and bright- 
ness, hue and brightness, and size, hue, and 
brightness. 

The results demonstrated that the discrimi- 
nability for a multidimensional series of stimuli 
was considerably greater than that obtained for 
any of the compounding dimensions used alone. 
It was further shown that the discrimination 
accuracy for a compounded or multidimensional 
series of stimuli could be predicted with rea- 
sonable precision if the discrimination accuracy 
of the compounding dimensions was known. 
This predictability was based upon the assump- 
tion of independence in the accuracy with which 
magnitudes on two or more dimensions are simul- 
taneously judged. 
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CATEGORIES ON LEARNING OF A CONSTANT 
NUMBER OF VISUAL STIMULI? 


HAROLD W. HAKE AND CHARLES W. ERIKSEN 


The Johns Hopkins University 


When Ss are required to discrimi- 
nate among a small set of previously 
unfamiliar visual stimuli, their ability 
to recognize each stimulus as a distinct 
member of the set can be increased by 
practice in verbally labeling the 
stimuli. The effects of such verbal 
training, during which Ss learn to 
identify each stimulus with a unique 
irrelevant label, have been found in 
the enhancement of subsequent at- 
tempts of the same Ss to learn a new 
set of responses to the same stimuli 
(4, 5, 9, 10) or to discriminate among 
these same stimuli by a comparison 
method (1). 

It is reasonable to expect a positive 
transfer effect to follow labeling prac- 
tice because verbal training consti- 
tutes practice in discriminating among 
the stimuli. We should expect a gain 
whether the discriminating responses 
learned were verbal or some other set 
of motor responses. But it does not 
appear that the transfer effect results 
from the fact that Ss are practicing 
the task of discriminating, as such. 
For example, we have no clear indi- 
cation that transfer results because of 
factors suggested by Arnoult, such as 
“warm-up” effects or transfer of 
elements specific to the nature of the 
task or procedures rather than to the 
stimuli (1). We do know, however, 
that the transfer effect following 


! This research was done under Contract No. 
AF 33 (038)-22642, between the Institute for 
Cooperative Research, The Johns Hopkins Uni- 
versity, and the Psychology Branch of the Aero 
Medical Laboratory, Directorate of Research, 
Wright Air Development Center. 
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verbal practice exceeds that which 
follows simply viewing the stimuli an 
equal number of times or even viewing 
the stimuli with the understanding 
that discriminative responses to the 
same stimuli would be required at a 
later time (5). 

The fact that the perceptual gain 
following formal labeling practice ex- 
ceeds that which follows simply view- 
ing the stimuli is explicable from two 
points of view. The first stresses the 
importance of the information which 
is transferred to S on each trial of 
labeling practice. After S identifies 
the stimulus presented on each trial, 
he is routinely informed about the 
correctness of his identification or is 
told the arbitrarily predetermined 
correct label for that stimulus. In 
this way S can avoid confusing the 
stimuli perceptually because he has 
the opportunity to compare his own 
subjective identifications of the stim- 
uli with a set of perfectly accurate 
external criteria. One advantage of 
learning verbal labels for anything 
may be that it allows us to compare 
the distinctions which we make among 
the objects of the world with those 
made by others. 

If the information supplied to S on 
each trial is important in this way, 
then the transfer effect resulting from 
practice should depend upon the 
specificity with which he can identify 
the stimuli during practice. Where 
the number of labels used is a fraction 
of the number of stimuli in the set, 
for example, then S cannot state on 
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each trial which stimulus has ap- 
peared. He can only indicate that 
he perceives the stimulus as one of a 
group sharing the same label. This 
is true also of the information supplied 
on each trial, following the labeling 
attempt, which could indicate only 
the correct general label for the 
stimulus which had appeared. When 
the specificity of labeling is decreased, 
therefore, the transfer effect should 
decrease also because the external 
criteria with which S can compare 
his own identifications of the stimuli 
during practice are more ambiguous. 

The same conclusion can be drawn 
from the second point of view, the 
notion that the critical aspect of 
verbal labeling practice is the acqui- 
sition and retention of the arbitrary 
relations between labels and stimuli 
(4). When S is allowed to use labels 
during verbal practice which are fewer 
in number than the stimuli, we should 
expect the specificity of the label- 
stimulus associations formed during 
practice to decrease and the rein- 
forcement of attention to cues unique 
to each stimulus to be more ambigu- 
ous. And, if distinctive labeling has 
cue-producing properties as suggested 
by Miller (8), then, where a single 
label corresponded to a subgroup of 
stimuli, fewer cues would be available 
for discriminating among the stimuli 
at the end of practice. Experimental 
attempts to support this latter notion 
have produced conflicting evidence 
(3, 9). 

Whatever the mechanism involved, 
it appears clear that if the degree of 
stimulus differentiation achieved dur- 
ing practice is closely related to the 
ability of S to state explicitly which 
stimulus appears on each practice 
trial, then the positive transfer effect 
resulting from practice should be 
decreased by any practice method 
which restricts this ability. In the 
experiment reported here we restricted 


the ability of S to specifically identify 
the stimuli during a practice session. 
This was done by forcing him to use 
a set of responses which was smaller 
in number than the set of stimuli to 
be labeled. He was required to learn 
to apply either 2, 4, or 8 labels to a set 
of 16 previously unknown patterns. 
An orthogonal design permitted an 
evaluation of the degree to which the 
resulting transfer effects were specific 
to the labeling tasks experienced in 
original practice and in a subsequent 
testing situation. In addition, the 
relationship between learning per- 
formances in the two phases of the 
experiment was studied in terms of 
individual Ss within groups and for 
individual stimuli as labeled by indi- 
vidual Ss. 


METHOD 


Apparatus. The visual patterns which Ss 
identified consisted of patterns of lights ap- 
pearing ona 5 X 5 matrix of 25 small neon bulbs. 
The matrix was 7 in. on a side, and Ss faced it 
at a distance of 8 ft. Each of the 16 patterns 
consisted of five lighted bulbs which were se- 
lected at random from the matrix and which had 
no apparent meaning or structure when viewed 
together. The appearance of the patterns was 
controlled by an electronic sequence programmer 
which read the patterns from Teletype tape and 
presented them automatically on the matrix. 
The successive appearances of the patterns were 
spaced 10 sec. apart and each appearance lasted 
2 sec. 

Procedure.—All Ss saw each of the 16 patterns 
20 times in a total of 320 trials. The patterns 
appeared in a partially restricted random order 
which provided for the appearance of all 16 
patterns in the first 16 trials, in the second block 
of 16 trials, etc. During each trial and im- 
mediately following the appearance of the pat- 
tern, Ss were required to call out a label for that 
pattern. Each S had a card in fron: of him on 
which was printed a list of the labels he was 
permitted to use in labeling the patterns. The 
labels consisted of letters of the alphabet chosen 
to have as little sequential connection as possible. 
Immediately after Ss responded with a label on 
each trial, they were told the arbitrarily pre- 
determined correct label for the pattern which 
had appeared on that trial. The task of each 
S was to learn which of the labels from his list 
could be correctly applied to each of the patterns. 
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TABLE 1 


Means Aanp SD’s oF FREQUENCY OF CorRECT IDENTIFICATIONS 
PER Brock or 16 TRIALS 
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The experimental procedure consisted of two 
tasks for all Ss: an original learning task of 160 
trials during which they identified the 16 pat- 
terns using one set of labels, and a transfer task 
during which they identified the same patterns 
using another set of labels. Nine groups of Ss 
were used, each of which used a different set of 
labels in the two tasks. Assignment of sets of 
labels to the nine groups was determined by a 
factorial design in which the 10 Ss in each of the 
nine groups was assigned one of the nine com- 
binations of the three sets of labels which could 
be used in the learning task and the three sets 
of labels which could be used in the transfer task. 
This design provided 30 Ss who used just 2 labels 
in the learning task, 30 Ss who used 4 labels in 
the learning task, and 30 Ss who used 8 labels in 
the learning task. Of the 30 Ss using a par- 
ticular number of labels in the learning task, 10 
later used 2 labels, 10 later used 4 labels, and 10 
later used 8 labels in the transfer task. In all 
cases, including those Ss who used the same 
number of labels in the two phases, the labels 
used in learning were not those used in the 
transfer task. There was no meaningful re- 
lationship between the sets of labels used in the 
two tasks. Wherever possible the stimulus pat- 
terns which had shared the same label during 
the first task were separated and assigned to 
different labels during the second task. 

Subjects ——A total of 90 Ss was used, 10 in 
each of the nine experimental groups. They 
were men undergraduate students who were run 
through the, procedures one at a time. 


RESULTS 


The data of this study are the 
average numbers of correct identifi- 
cations per block of 16 trials made by 


Ss during all trials of the learning and 
transfer tasks. A correct identifi- 
cation was recorded whenever S re- 
sponded to a visual pattern with the 
arbitrarily predetermined correct label 
for that pattern. 

The data were analyzed first to 
determine the effect upon performance 
of the number of labels used by Ss in 
identifying the patterns during origi- 
nal learning. The study of the effect 
of this variable upon performance 
during the transfer task was one of 
the main objectives of the experiment. 

The effect of response restriction upon 
performance.—The average numbers 
of correct identifications per block of 
16 trials averaged over all learning 
trials are presented in the left half of 
Table 1. The same measures of per- 
formance for the transfer task are 
shown in the right half of this table. 
The groups were assigned to the rows 
according to the number of labels used 
in original learning, and average per- 
formance in the rows provides an 
estimate of the effect of this variable 
upon performance during each task. 
The performance of each group of Ss 
in the first task is roughly propor- 
tional to the probability of chance 
success associated with the number of 
labels used in that task, although the 








performances of Ss using 4 and 8 labels 
are more similar than such a simple 
notion would predict. 

Average transfer performance in 
each of the rows of the right half of 
Table 1 measures the effect of the 
number of labels used in original 
learning upon performance during the 
transfer task independent of the 
nature of the transfer task. This 
effect appears to be slight. The Ss 
who used 2 or 8 labels in the learning 
task did slightly better during the 
transfer task than Ss who had used 4 
labels. 

The data of Table 1 were analyzed 
separately for each task by an analysis 
of variance technique after the group 
variances of each task had been tested 
for heterogeneity by Bartlett’s test 
(2) and could be assumed to be homo- 
geneous. Variation among the row 
means of Table 1 for the learning task 
was found to be significantly large 
(P < .001), as expected, because of 
the differences among the possibilities 
of chance successes associated with 
each number of labels. For the same 
reason there was a significant vari- 
ation among the column means of the 
transfer data of Table 1. There were 
no other significant effects. The effect 
of the number of labels used in original 
learning upon performance during the 
transfer task was not reliable, nor was 
there any indication that the numbers 
of labels used in original learning inter- 
acted with the numbers of labels used 
in the transfer task.” 


While analysis of the data provided no evi- 
dence that the nature of experience with the 
patterns during learning differentially affected 
transfer performance, the evaluation of the 


2 The same analysis was performed after cor- 
recting for chance success. This removed all 
sources of significant variation. The effect of 
the number of labels used in learning upon the 
scores of the transfer task, after correction for 
chance, was still an insignificant source of 
variation. 
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groups according to the number of labels used 
originally ignores another original learning vari- 
able which was not constant from group to group. 
This is the number of times Ss were correct in 
using the permissible labels in the first task. 
The different probabilities of chance success 
associated with the various numbers of labels 
used produced a significant variation from group 
to group in the number of correct labeling re- 
sponses achieved in the initial labeling task. In 
effect, we have an unavoidable inverse relation- 
ship in both tasks between the number of labels 
permitted and the number of occasions in which 
the labels were applied correctly to the patterns. 
This relationship prevented a separate analysis 
of the effect of these two factors upon transfer 
performance. 

The correctness or incorrectness of labeling 
responses during the learning task should be a 
critical aspect of S’s experience with the patterns 
if differentiation of the patterns during that task 
resulted from the act of learning a label for each 
pattern. Presumably, the correctness of label- 
ing responses indicates the relative strengths of 
the specific stimulus-response associations built 
up during practice. We should expect to find 
considerable consistency of labeling performance 
in the two tasks if the associations formed during 
learning were directly related to ability to dif- 
ferentiate the patterns. 

The consistency of labeling performance in 
the two tasks of the experiment was studied 
within each of the nine groups of our experiment. 
Two types of relationships were studied. The 
first was a measure of the consistency of per- 
formance of Ss in each group. This was the 
correlation between average performance in the 
learning and transfer tasks of the 10 Ss in each 
group. The second type of relationship meas- 
ured the consistency of performance within- 
subjects. This was the correlation within each 
S between the number of correct responses made 
to each pattern in the learning and the transfer 
tasks. 

We expected that these relationships would 
not be the same within all groups. This ex- 
pectation was based upon the reasonable assump- 
tion that the degree to which Ss using 2 labels 
needed to distinguish among the patterns in 
order to label them correctly during learning 
was less than that required of Ss using 4 or 8 
labels. With only 2 labels the effects of chance 
could contribute as heavily in producing correct 
responses to the patterns as could real ability to 
discriminate among them. The number of cor- 
rect identifications achieved during learning with 
2 labels could be expected, then, to be a less 
important indicator of the extent to which Ss 
could discriminate among the patterns than it 
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would be for Ss using larger numbers of more 
specific labels. 


Relationship between learning and 
transfer performance.—Coefficients of 
correlation were computed from the 
average learning and transfer per- 
formances of the 10 Ss in each of the 
nine groups. These are the within- 
group coefficients presented in Table 
2. Of special interest are the average 
within-group correlations computed 
from the pooled variates of the three 
groups of Ss who used a particular 
number of labels originally, where 
each deviation involved was measured 
from its own group mean (11). These 
coefficients based upon the pooled 


TABLE 2 


CoRRELATION AND REGRESSION 
CoEFFICENTS FOR THE 
Nine Groups 

Within-group coefficients computed from the 
average learning and transfer scores of 10 Ss in 
each group; within-S coefficients computed from 
average learning and transfer scores for indi- 
vidual patterns within Ss. 





























Within-Groups a0 Within-Ss* 
Groups | aaa 
Corre- Regres- Corre Regres- 
lation | sion z lation sion 
2-2 58 62 | 24 19 
2-4 56 .97 16 16 
2-8 44 | .46 20 27 
Pooled 
data** om | £6 | IP | ma 
eis i s&s 33 25 
44 | 39 38 12 ll 
48 | 30 | 59 13 15 
Pooled | 
data** | .30 43 | dP | dé 
8-2 63 .66 .28 19 
84 | 64 | .67 19 | 218 
8-8 | .90 | 1.30 Bw | 
Pooled | 
data** 2 a a ee 18 
| 


} 
*Within-S coefficients are based upon pooled 
variates from the 10 Ss in each group, each deviation 
involved being measured from its own S mean (11). 

* These coefficients are based upon pooled variates 
bene the 30 Ss using the same number of labels in the 
learning session, each deviation being measured from 
its own group mean or own S mean (11). 
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data indicate some tendency for the 
correlation between learning and 
transfer performance to be reduced 
for Ss using the less specific labels 
originally. These differences are not 
statistically reliable, however, and the 
within-group regression can be as- 
sumed the same for all groups. The 
average within-group correlation of 
.57 for all groups properly can be 
taken as the best estimate of the 
relationship within all groups (11) 
Accounting for the average linear 
within-group regression results in a 
33% reduction in the within-group 
variance, which is a_ statistically 
significant reduction (P < .01). 

The significant relationship within 
the groups demonstrates a consistency 
of performance in the twotasks. The 
Ss who did well in learning did well 
in the transfer task also. But the 
implication of this reliable relation- 
ship is not straightforward because it 
could reflect merely the individual 
differences among Ss in ability to 
perform the labeling tasks. To the 
extent that the two labeling tasks 
demanded common skills we could 
expect those Ss who did well in learn- 
ing to do well on the transfer task 
also. This may account for between- 
task correlations reported before in 
the context of verbal pretraining (1). 

More specific information about the 
relationship between transfer per- 
formance and the number of correct 
identifications achieved during learn- 
ing is provided by the within-S re- 
lationships. An evaluation of the 
relationship between performance in 
the two tasks within each S indicated 
the extent to which the patterns that 
S labeled successfully in transfer were 
those that he had labeled most suc- 
cessfully originally. 

The within-S regression and cor- 
relation coefficients are presented in 
Table 2 as average within-S coeffi- 
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cients for each of the nine groups. 
These coefficients are based upon 
pooled values from the 10 Ss in each 
group, each deviation involved being 
measured from its own subject mean 
(11). Analysis of the small differ- 
ences among the nine average within- 
S, within-group regression coefficients 
indicated that they were not sig- 
nificantly large. The average within- 
S correlation for all Ss is .20.* 

Accounting for the average within-S 
regression does significantly reduce the 
transfer task within-subject variation 
(P < .01). The predictive value of 
this relationship is quite low, however, 
as the resulting reduction of within-S 
variation is only 4%. This is de- 
cidedly a very small effect compared 
to that we should expect to find if the 
number of correct identifications were 
the critical variable determining dif- 
ferentiation of the patterns prior to 
the transfer phase. Most of the 
variability which each S showed in 
labeling the 16 patterns during the 
transfer task must be attributed to 
factors other than the relative success 
with which S labeled each pattern 
originally. 


Discussion 


The results are not in accord with our 
notion that the number of labels used 
during initial verbal practice should de- 
termine the specificity or accuracy with 
which Ss learn to recognize each pattern 
during such practice. The use of two 
labels by one-third of our Ss in original 
learning allowed them only to assign ex- 
plicitly the patterns to one of the two 
subsets corresponding to the two labels. 
Their transfer performance, however, 
was at least as good as that of Ss who 


3’ Both the within-group and the within-S 
relationships can be assumed linear in form. 
The addition of quadratic terms to the average 
regression equations did not result in significant 
reductions in the variances of the transfer scores 
under those achieved by fitting linear regression 
lines to the data (11). 
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could explicitly identify the patterns as 
belonging to one of eight subsets during 
the original task. 

Contrary to our expectation also, the 
success with which Ss used the labels in 
the first task, regardless of the nature of 
the labels used, does not appear related 
in a critical way to the perceptual gains 
which resulted from labeling practice. 
The performance of Ss in the transfer 
task was related to performance during 
original learning to a significant degree, 
but we have reason to assume that 
transfer performance would be related 
to the same or greater extent to other 
tests of paired-associate memory or per- 
ceptual speed which did not use our 
stimulus patterns. Attneave has demon- 
strated a negative correlation of .64 be- 
tween scores on two tests of the Airman 
Classification Battery (6 pp. 248-251, 
376-382) and errors in labeling visual 
forms.4 If we choose, we may regard 
performance in the learning task only as 
a measure of some pertinent abilities 
which each S brought to the experiment 
rather than as a measure of the degree to 
which he learned the stimuli during that 
session. 

The Ss undoubtedly learned to differ- 
entiate the stimuli during the first task, 
but we cannot unequivocally attribute 
what they learned to the act of learning 
the correct label for each stimulus. The 
number of times each stimulus pattern 
was correctly paired with its appropriate 
label in the first task, regardless of the 
number of labels used, was not im- 
portantly related to the success with 
which that pattern was discriminated 
from all others in the set and correctly 
labeled in the transfer task. This ap- 
pears to minimize the role that the 
specific stimulus-response associations 
formed in the first task could play in 
determining what is learned about the 
stimuli in that task. 

The results appear to us to emphasize 
the importance of the general labeling 
process rather than factors related to the 


‘Reported in an unpublished manuscript en- 
titled “Symmetry, information and memory for 
patterns.” 
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particular labels used. We may judge 
from our results and from others (5) that 
the perceptual gain resulting from label- 
ing practice appears to occur so long as 
Ss have a decision to make about the 
stimuli on each trial. The labeling task 
given to our Ss seems merely to have 
provided a context which defined an 
objective for them. Whatever the spe- 
cific nature of this objective, it required 
that Ss learn to discriminate among the 
stimuli, and this appears to have been 
accomplished to some degree by all Ss, 
independently of the nature of the label- 
ing task assigned. 


SUMMARY 


The Ss were required to learn to apply either 
2, 4, or 8 irrelevant verbal labels to a set of 16 
similar and previously unfamiliar complex visual 
patterns. Immediately following practice in 
this task, Ss were required to learn to identify 
the patterns, again using a different set of ir- 
relevant verbal labels that were either 2, 4, or 8 
in number. 

The performance of Ss on this transfer task 
was significantly affected by the number of labels 
used on this task but not by the number of labels 
used in the initial learning task. The extent to 
which each S correctly labeled each pattern 
during learning was not indicative to any im- 
portant degree of his success in labeling each 
pattern during the transfer task. 

The performance of each S during learning, 
relative to the other Ss in each group, was indica- 
tive of relative performance within each group 
on the transfer task. It was suggested that this 
relationship may be indicative merely of the 
extent to which both tasks demanded common 
skills. 'The extent to which Ss learned to differ- 
entiate the stimuli during learning was not indi- 
cated unequivocally by measures of performance 
in that task, and the perceptual gain which 
occurred from labeling practice was independent 
of the nature of the labeling task assigned. 
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INFLUENCE OF FRAME SIZE ON APPARENT 
LENGTH OF A LINE 


THEODOR M. KUNNAPAS 


University of Stockholm 


In his classical investigation Rubin 
(3, 4) found that a visual perception 
is normally divided into two fields: a 
figure, which is usually the object of 
attention, and a ground, which oc- 
cupies the rest of the perception. 
This distinction becomes especially 
clear in ambiguous fields as, e.g., in 
the vase-profile figure. He pointed 
out the phenomenal difference be- 
tween the figure and the ground. The 
figure looks like a thing (Ding- 
charakter), whereas the ground ap- 
pears as a substance (Stoffcharakter). 
The figure seems to be nearer to O 
than the ground. The contour be- 
longs to the figure and not to the 
continuous ground. 

The figure-ground problem is de- 
veloped further by gestalt psycholo- 
gists, in the main by Kurt Koffka (2). 
He divided the behavioral environ- 
ment into things and frameworks. 
Koffka pointed out that the figure- 
ground articulation is a part of that 
more general problem, things and 
frameworks. Figures are things, and 
grounds operate like frameworks in 
perception. He says (2, p. 184): “The 
figure depends for its characteristics 
upon the ground on which it appears. 
The ground serves as a framework in 
which the figure is suspended and 
thereby determines the figures.” He 
demonstrated excellently the frame- 
work character of the ground by its 
influence on the shape of the figure. 

In the present investigation an at- 
tempt is made to study how the frame 
size influences the apparent length of 
a line inclosed within a frame. The 
experimental situation is simplified: 
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the line is one of the simplest figures 
as well as the square is one of the 
simplest frames. In this way some 
more exact quantitative formulations 
about the influence under study may 
be expected. 


MeTHOD 


Stimulus materials.—White square pieces of 
cardboard were used as frames. The lengths of 
the sides of these squares were 7, 9, 12, 16, and 
21cm. The length of the constant square was 
7 cm., and the other lengths refer to the variables 
that were compared with it. In the middle of 
each square a line was drawn. The length of 
the line in the constant square was always 50 
cm., while in the variable squares it varied. The 
interval of variation was 1 mm. The width of 
the lines was .4 mm. 

Procedure-—White square pieces were placed 
on the very large sky-blue background, only two 
at a time—the constant square and a variable. 
The variable square was placed either to the 
right (R) or to the left (L) of the constant. The 
variable square was always placed on the back- 
ground so that the line at the middle was at the 
same height as the line in the constant. The 
distance between the two lines was always 13 
cm. Consequently the distance between the two 
squares on the background was variable: the 
largest square was placed 4 cm. from the constant 
square, the other squares were 6.5, 8.5, and 10 
cm. (the smallest square) distant from it. 

The two lines that were compared were situ- 
ated 30° below eye level. The angle of exposure 
of the squares was 60°. The distance between 
S’s eyes and the lines was 110 cm. 

The constant method was employed. After 
preliminary trials, for every size of square a 
limited number of lines were selected which were 
presented to Ss in a prearranged order that was 
unknown to them. ‘The S had to compare the 
length of the exposed two lines in two squares, 
and report whether the variable line was shorter, 
equal to, or longer than the constant line. A 
maximum of 5 sec. was allowed for making the 
judgments. ‘The cases where S could not make 
a definite report during this time were counted 
as “equal.” 
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TABLE 1 


Lenctu or Line (IN MM.) WHICH APPEARS 
EQuaL To THE ConsTANT LINE oF 
50 mm. IN THE CONSTANT 




















SQUARE 
Size of 
Square | PSE Li Ls IU 
(Cm.) 
7x7 | (50.00) | 
9x9 51.33 | 52.53 | 49.69 | 2.84 
12X12 52.68 54.19 51.01 3.18 
16X16 54.16 55.37 52.43 2.94 
21X21 55.63 57.14 | 53.98 3.16 





Each size of the variable squares was used 20 
times on two different days: 10 L, and 10 R 
positions. The sequence of the experiments was 
rotated so as to equate any possible learning and 
fatigue effects. 

Subjects —Ten students, five men and five 
women, took part in the experiments. All Ss 
had normal visual acuity. 


RESULTS 


For each of the four different sizes 
of the square, the point of sub- 
jective equality (PSE) was taken as 
the significant measure of Ss’ re- 
sponses. Also the upper limen (L;) 
and the lower limen (L,), as well as 
the interval of the uncertainty (IU), 
were computed by Spearman’s meth- 
od of the arithmetic mean (1). 

Table 1 gives the general results. 
There is no doubt that the apparent 
length of the line which is in the 
middle of a square-shaped frame de- 
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Fic. 1. Length of the line which appears 
equal to the constant line of 50 mm., plotted 
against the area of the square-shaped frame. 
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Fic. 2. Length of the line which appears 
equal to the constant line of 50 mm., plotted 
against the logarithm of the area of the square- 
shaped frame. 


pends on the size of this square. The 
larger the square, the longer the line 
must be physically if its length is to 
appear equal to that of the constant 
line of 50 mm. in the middle of the 
constant square. Consequently an 
increase in the size of the frame re- 
duces the phenomenal length of the 
line drawn on it, just as if it were 
compressed. 

Figure 1 shows that the physical 
length of the line which appears equal 
to the constant line of 50 mm. does 
not increase rectilinearly. The curve 
is rectilinear, however, if plotted 
against the logarithm of the area of 
the square-shaped frame (see Fig. 2). 
Thus, the phenomenal length of the 
line is a logarithmic function of the 
area of the square enclosing it. 

Figure 3 indicates that the length 
of the line likewise increases recti- 
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Fig. 3. Length of the line which appears 
equal to the constant line of 50 mm., plotted 
against the logarithm of the side of the square- 
shaped frame. 
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linearly, if plotted against the loga- 
rithm of the side of the square en- 
closing it. This means that the 
phenomenal length of the line is also 
a logarithmic function of the side of 
the square-shaped frame. 

In reality these two functions are 
the same if we are dealing with a 
square. According to Fig. 2, L = 
k log A, where L = phenomenal length 
of the line, k = constant, and 4 = 
area of the square. But in a square, 
A = S*, where S = side of the square. 
Substituting S? for 4, we obtain L = 
2k log S, or L = clog S, where ¢ = 
2k. 

Also Ss’ individual responses show 
the same relation. In Fig. 4 the re- 
sponses of three Ss are given: a repre- 
sents one of the best fittings, ) repre- 
sents a typical curve, and ¢ represents 
one of the atypical curves. In c the 
deviations from the rectilinear in- 
crease are greatest. 


Observations made during the course 
of the experiments indicate that in the 
line-frame articulation the frame influ- 
ences the line not only in a horizontal but 
also simultaneously in a vertical direc- 
tion. The frame reduces both the length 
and the width of the line at the same 
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Fic. 4. Samples of individual responses: (a) 
one of the best, (b) a typical curve, and (c) an 
atypical curve. 
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time. The lines, which were of the same 
width (.4 mm.) in all the squares, seemed 
to have different width according to the 
different size of the square-shaped frames. 
In the larger squares the lines appeared 
to be thinner, while in the smaller squares 
they seemed to be thicker, i.e., the influ- 
ence of the frame on the width of the line 
has the same direction as on the length. 
This difference was so noticeable that 
some Ss asked sometimes whether the 
lines really were of the same width. 
Thus we may say that the frame influ- 
ences the enclosed line in all main direc- 
tions at the same time. There is also 
reason to assume that the same influence 
occurs in the more complex figure-frame 
situations. 


SUMMARY 


The present study is concerned with how 
frame size influences the apparent length of a 
line inclosed within the frame. The square, as 
one of the simplest geometrical figures, served as 
frame. Five different sizes of the square were 
employed. The lengths of the lines in the middle 
of these squares were compared with the constant 
line of 50 mm. on the constant square 7 X 7 cm. 

The results indicate that the frame greatly 
influences the apparent length of the line. The 
phenomenal length of the line is a logarithmic 
function of the area and the side of the square- 
shaped frame. Observations made during the 
course of the experiments indicate that in the 
line-frame articulation the frame influences the 
line not only in a horizontal direction but also 
probably in a vertical direction. 
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A COMPARISON OF PSYCHOPHYSICAL METHODS IN THE 
INVESTIGATION OF FOVEAL SIMULTANEOUS 
BRIGHTNESS CONTRAST! 


A. L. DIAMOND, H. SCHEIBLE,? E. SCHWARTZ*, AND R. YOUNG 


Northwestern University 


Investigations of simultaneous 
brightness contrast have shown that 
adjacent visual fields can affect one 
another’s apparent brightness. In 
such investigations the field in which 
the changes in apparent brightness 
are measured is called the test field. 
The adjacent field is termed the in- 
ducing field. 

The technique usually employed to 
measure the effect of the inducing field 
upon the test-field apparent bright- 
ness involves a binocular match (1, 
2,4,5). That is, S is asked to equate 
the apparent brightness of the test 
field, presented to one eye, to the 
apparent brightness of a standard or 
match field, presented to his other eye. 
In this way, when the test-field ap- 
parent brightness changes as a func- 
tion of some variation in the inducing 
field (presented to the same eye as the 
test field) the brightness equality 
match, between match and test fields, 
indicates the extent of this change. 

Further, as in the above-cited 
studies, the psychophysical method 
employed is usually that of adjust- 
ment, as described by Graham (3). 
In this method, S varies either the 
match-field or the test-field luminance 
in order to obtain a brightness equal- 
ity match between the two. The 
fact that S controls the luminance 
changes necessary in the brightness 

1 This work was supported in part by a re- 
search grant. NSF-G923 from the National Sci- 
ence Foundation and in part by the Graduate 
School of Northwestern University. 

? Now at the University of California, Los 
Angeles. 

3 Now at DePaul University, Chicago. 
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match, therefore, enables him to uti- 
lize such motor cues as might arise 
from his manipulation of the appa- 
ratus, in addition to the experimen- 
tally intended visual cues of test- and 
match-field luminances. 

It is conceivable then that certain 
aspects of the contrast phenomenon 
might be peculiar to the particular 
experimental method  used,—the 
method of adjustment in this case. 
The purpose of the present paper is 
to study simultaneous brightness con- 
trast using three different psycho- 
physical methods to determine: (a) 
whether the characteristic effects seen 
in contrast are dependent upon a 
particular investigating method; and 
(b) which method is the most useful 
and reliable in studying contrast. 

This is tested in three separate ex- 
periments, in each of which two psy- 
chophysical methods are compared as 
follows: Exp. I compares the method 
of limits to that of constant stimuli; 
Exp. II compares the method of ad- 
justment to that of constant simuli; 
and Exp. III compares the method of 
limits to that of adjustment. Each 
of these methods is generally described 
by Graham (3). 


METHOD 


General Method 


A pparatus.—The apparatus used in all three 
experiments is identical to that described by 
Diamond (1). It provides as a measure of test- 
field apparent brightness the luminance of a 
match field, which can be set, either by E or S, 
equal to the test-field apparent brightness. The 
test and inducing fields are adjacent squares, 
each 33’ of visual angle on a side, presented to 








S’s right eye, which fixates a 3’ point of light 
located 21’ to the left of the test field. The 
match field, also a 33’ square, is presented to S’s 
left eye which fixates a 3’ point of light located 
21’ to the right of the match field. The S is 
instructed to fuse left- and right-eye fixation 
points so that in the binocular view one fixation 
point is seen between the match field on the left 
and the test field, underneath the inducing field, 
onthe right. The visual angle between the fused 
fixation point and the farthest corner of any field 
is 70’. Stimulation, therefore, is essentially foveal. 

Procedure—The common procedure was as 
follows : Seated in a light-tight cubicle, S initially 
dark adapted for 3 min., then light adapted for 
3 min. to (a) the test field, always at 0.6 log ml.; 
(b) the match field; and (c) the inducing field, 
its luminance being zero at the outset of each 
session. Then, under each of the nine conditions 
of inducing-field luminance (varied in order from 
— «© to 2.71 log ml.) S was given four practice 
matches followed immediately by a set of experi- 
mental matches. There were 10 such experi- 
mental matches per condition under the methods 
of adjustment and limits, and 50 under constant 
stimuli. One minute of dark adaptation and 1 
min. of light adaptation came between the suc- 
cessive inducing-field luminance conditions. In 
each of the three experiments a different E and 
different S were used. 


Experiment I 

The methods compared in this experiment 
were that of limits and constant stimuli. Two 
Ss, AL and DR, were given both methods. AL 
received limits the first session and constant 
stimuli the second, DR took these in reverse 
order. 

Method of limits—To obtain an apparent 
brightness judgment, £ first increased the match- 
field luminance in small steps, from that point at 
which S reported its apparent brightness to be 
definitely less than that of the test field until S 
judged the match field to appear just noticeably 
brighter than the test field. Five increasing and 
five decreasing judgments were made, in counter- 
balanced order (ABBA ) and the mean of 
ten match-field luminances corresponding to 
these judgments was taken as a measure of the 
test-field apparent brightness. 

Method of constant stimuli.—The E first de- 
termined a range of match-field luminances the 
extremes of which appeared to S as just dimmer 
and just brighter than that of the test field. 
Then, in this range he selected five equally 
spaced luminances, setting the match field at 
each luminance value in a random manner until 
he had presented each value 10 times. At each 
individual setting, S reported whether the match 
field appeared brighter or dimmer than the test 
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field. The E then blocked the match-field light 
source and changed its luminance to another 
value. Since this change required a maximum 
of 8 sec., the match field was exposed for 8 sec. 
during each individual judgment in order to keep 
the left eye at a constant level of adaptation. 
The percentage of occurrence of the response 
“brighter” was then graphed for the five match- 
field luminance values, and that luminance which 
coincided on the graph with the 50% point along 
the ordinate (5 out of 10 responses being 
“brighter”) was taken as the average measure 
of the test-field apparent brightness. 


Experiment II 


The methods of adjustment and constant 
stimuli were compared in this experiment. Two 
Ss, SM and BM, were given both methods in 
counterbalanced order. 

Method of adjustment.—In making his bright- 
ness equality judgments, S (rather than £) set 
the luminance of the match field so that it ap- 
peared equal to that of the test field. The E 
then changed the match-field luminance in a 
random direction after each match. The S had 
to adjust the match-field luminance again until 
it seemed equal in apparent brightness to that of 
the test field. The mean of 10 such match-field 
luminance settings was taken as a measure of 
the test-field apparent brightness. 

Method of constant stimuli.—This is identical 
to the same method described above. 


Experiment III 


In this experiment the methods of limits and 
adjustments were compared for three Ss. The 
first two, PC and MG, first had adjustment, then 
limits. The third, PS, took the methods in 
reverse order. These methods are identical to 
the same methods described above. 


RESULTS 


With regard to the general appli- 
cability of the methods tested, one 
result is clear. The method of con- 
stant stimuli is much more time con- 
suming and laborious than the other 
two methods. Experimental sessions 


under constant stimuli took from 90 
to 120 min. to collect a comparable 
amount of data collected in 45 to 60 
min. under the other methods. In 
addition, under constant stimuli, Ss 
complained of eye strain, headaches, 
and fatigue, and Es of fatigue, whereas 
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TABLE 1 
Test-Fietp ApPpARENT Bricutness (B,,) as A FUNCTION oO} 
InpucING-FiELpD LumINANCcE (B;) 





(Data are in log millilamberts) 








Sadesion. Exp. II Exp. III 
Field ——— a oa . : 
yomr | Method of | Method of Method of Method of Method of Method of 
Limits Con. Stim. Adjust. Con. Stim. Limits Adjust. 
O* | 60* 60* | .60* 60* .60* 
—1.51 .70 | .66 | 65 } .64 59 42 
—0.43 66 | 72 Al | 56 | 52 72 
0.06 75 .63 Ry 50 52 | 67 
0.60 69 | 44 | 36 | 430 CO 55 | 62 
1.14 40 .26 .20 32 29 21 
163 | 13, | — 08 07 oO 06} 6-04) 6} | ~—09 
2.16 ~—M | «# | <8 | = | =P | wm 
2.71 — 32 — 4] — .35 — 30 — .60 | —.61 


* Test-field luminance. 


these complaints did not appear under 
the two other methods. There were 
no differences between the methods 
of limits and adjustment in this re- 
spect. In terms of time consumption 
and fatigue, they were equally ef- 
ficient. 

With regard to the actual settings 
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Fic. 1. Test-field apparent brightness (Bm) 
as a function of inducing-field luminance (B;) as 
investigated under three different psychophysical 
methods. The test-field luminance is .60 log 
millilamberts for each curve. 


under each method, these are pre- 
sented in Table 1 and Fig. 1. Since 
there were no systematic differences 
between Ss within each experiment, 
their data were averaged. 

The general shape of the function 
in Fig. 1 is the same for all methods in 
all experiments. ‘Test-field apparent 
brightness is depressed slightly, if at 
all, for inducing-field luminances less 
than that of the test field. For in- 
ducing-field luminances greater than 
that of the test field, test-field ap- 
parent brightness is depressed roughly 
in direct proportion to the luminance 
of the inducing field. This is in agree- 
ment with previous experiments (1, 
4). 

Although the general shape of the 
function remains the same, the extent 
of the depression of the test-field ap- 
parent brightness is slightly different 
for the two methods in Exp. I, i.e., 
the test-field apparent brightness is 
somewhat more depressed under the 
method of constant stimuli than under 
the method of limits. It should be 
pointed out, however, that in Exp. II 
and III, both limits and constant 
stimuli give the same results as ad- 
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justment, so that the discrepancy 
between the curves in Exp. I cannot 
clearly be attributed to differences in 
the methods themselves. 

A comparison can be made between 
the variances of all judgments under 
the methods of adjustment and limits. 
Constant stimuli is not included in 
this comparison since, according to 
Woodworth (6, p. 427), variability 
computed in this method is not di- 
rectly comparable to that computed 
from the other two methods. This 
comparison reveals no difference be- 
tween variances in the methods of 
adjustment and limits. The median 
variance in each method is .02 log ml. 


Discussion 


The results indicate that the inhibitive 
effect seen in contrast experiments (1, 2, 
4, 5) is not peculiar to a particular type 
of methodology used in its investigation. 
The characteristic contrast effect ap- 
peared in all three methods used in the 
present paper. 

It would seem, however, that the 
method of constant stimuli is less useful 
in studying contrast because of the ex- 
cessive time and fatigue involved in 
procedure. In this respect, the methods 
of adjustment and limits are decidedly 
more acceptable since an entire curve can 
be obtained easily, without fatiguing § 
or E, in one experimental session. 

With regard to reliability, both adjust- 
ment and limits are equally acceptable 
in investigating contrast. 


SUMMARY 


Foveal simultaneous brightness contrast was 
investigated with respect to. the psychophysical 
methods generally used in its investigation. The 
apparent brightness of a 33’ test-field square was 
studied as a function of the luminance of a 33’ 
adjacent inducing-field square. A_ binocular 
matching technique was used to measure the 
test-field apparent brightness. Three psycho- 
physical methods of adjustment, limits, and 
constant stimuli were investigated. 

The results indicate no difference in the char- 
acteristic shape of the contrast cugve for the 
different methods. The methods of adjustment 
and limits were equally good in terms of relia- 
bility and convenience, whereas the method of 
constant stimuli was much more fatiguing for 
both S and E and took about twice as much time 
for the same amount of data obtained with the 
other two methods. 
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PSYCHOMOTOR PERFORMANCE AS A FUNCTION OF 
AMOUNT OF TRAINING AND STRESS 


ALFRED CASTANEDA AND DAVID S. PALERMO! 


Iowa Child Welfare Research Station, 
State University of Iowa 


A comparatively recent develop- 
ment is the work concerned with per- 
formance in complex learning tasks as 
a function of stress. A recent’review 
of this work (4) indicates that while a 
variety of methods have been used to 
vary stress, e.g., failure, excessive 
pacing, threat, etc., the common as- 
sumption is that they affect the level 
of motivation. The findings, with 
some exceptions, indicate that per- 
formance under stress is either facili- 
tated or impaired. The theoretical 
and practical importance of studies 
attempting to identify those variables 
determining the nature of the effect is 
generally recognized. 

Another recent development in the 
area of motivation is the work of 
Spence and his colleagues (2, 8, 10, 13) 
concerned with the integration of 
anxiety within Hull’s (3) framework. 
On the assumption that anxiety has 
motivational (drive) properties, cer- 
tain implications have been derived 
regarding performance as a function 
of variations in the motivational level. 
While this work has dealt more spe- 
cifically with anxiety, it has been 
indicated (8) that the more general 
concern has been in the isolation of 
some of the factors which determine 
the quality of performance as a func- 
tion of variations in the motivational 
level. On the assumptions that habit 
and drive jointly determine response 
strength, and that the total effective 

! The authors wish to express their gratitude 
to Mr. D. S. Stewart, Elementary School Cur- 
riculum Coordinator, Iowa City, and Mr. R. 


Hedeman, Superintendent of Schools, West 
Branch, for their cooperation and assistance. 


drive level (D) then operating com- 
bines multiplicatively with each of 
those habit tendencies aroused in a 
given situation, implications have 
been derived concerning the quality 
of performance in complex learning 
tasks, i.e., those involving strong 
competing responses. If the habit 
strengths of the competing incorrect 
responses are greater in relation to the 
correct ones, impairment of perform- 
ance should be expected with an 
increase in drive. However, if the 
correct responses are dominant, i.e., 
their strength is greater in relation to 
the incorrect ones, facilitation should 
be expected. Furthermore, with an 
increase in drive, the degree of im- 
pairment or facilitation should be 
positively related to the relative 
strength of the dominant habit at the 
time of the increase. These impli- 
cations have been confirmed in their 
now familiar series of studies (2, 7, 8, 
9, 10, 11, 12, 13) in which the Taylor 
Anxiety Scale has been employed as 
the empirical variable assumed to 
reflect differences in the drive level of 
human Ss. 

On the assumption that stress, 
varied by emphasizing to S the need 
for speed on the task, affects the level 
of drive (1, 6), the present study is 
concerned with the effects of stress on 
the quality of performance in a com- 
plex learning task. The task is de- 
signed to permit its analysis into 
specific S-R components and to permit 
the specification of the strength of the 
habit mediating each component as 
well as whether the habit is correct or 
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incorrect. This is accomplished in a 
paired-associate type psychomotor 
task on which varying amounts of 
preliminary practice are given where- 
upon the task is to re-pair under stress 
certain of the stimulus and response 
elements while the others are left un- 
changed. According to the assump- 
tions just outlined, the effects of stress 
(D) on the quality of performance 
during the relearning depend on the 
type of S-R pair, i.e., on whether the 
pair has been changed or left un- 
changed. If performance to the 
changed pairs is considered, the effect 
should be deleterious. If performance 
to the unchanged pairs is considered, 
the effect should be facilitation (fewer 
errors). Furthermore, the degree of 
this impairment or facilitation should 
be a positive function of the relative 
strength of the habit mediating a 
given S-R connection. 


METHOD 


Subjects and design—From four different 
schools, 108 fifth-grade Ss were assigned ran- 
domly, in equal numbers, to one of four experi- 
mental groups. The Ss in Groups I (weak 
habit, nonstress) and II (weak habit, stress) 
were given 25 training trials on the preliminary 
task; those in Groups III (strong habit, non- 
stress) and IV (strong habit, stress) 50 training 
trials. All groups were given 50 relearning trials 
at which time Groups II and 1V were adminis- 
tered the stress condition. Ignoring the factor 
of type of S-R pair, the design is a2 X 2 factorial 
in which the interaction between habit strength 
and stress is of primary concern, with the ex- 
pectation of greater differences in performance, 
to both types of S-R pairs, between Groups III 
and IV as compared to Groups I and II. 

Apparatus and procedure-—The apparatus 
consisted of a rectangularly shaped box, 9} X 
18 X 9 in. painted flat black. Centered 3 in. 
above the response panel was a l-in. diameter 
exposure aperture of flashed opal glass. The 
stimuli were five 6.3-v. pilot lamps colored green, 
amber, red, blue, and light red. The coloring 
was done by means of a General Cement Mfg. 
Co. Dial Lite Coloring Kit. With the exception 
of the red which received three dips in the 
coloring solution, all other lights received two 
dips. The lights were housed in a cluster in a 


No. 24 tin can which was coated with aluminum 
paint to increase reflection. The housing was 
fastened inside of the box with the open end 
flush to the exposure aperture. All controls used 
by E were set to the back of the apparatus. By 
means of a holding circuit in which S’s response 
broke the circuit E could actuate any one of the 
five lights and set any single push button to 
break the circuit by appropriate adjustments of 
two rotary selector switches. Hence S could 
deactivate the light only by selecting the button 
designated correct for that light. Self-corrected 
trials were employed with a trial defined as a 
single presentation of a stimulus. Of the five 
different light-button pairs used, three were 
changed for the relearning trials while two re- 
mained the same as in the training trials. With 
the restriction that no single light appeared 
twice in succession, the order of appearance of 
the lights was determined randomly within 
blocks of five trials for the first 25 trials. Where 
the procedure called for 50 trials, this sequence 
was repeated for the next 25 trials. 

The instructions were designed to indicate to 
S that he was required to associate which push 
button turned off a given colored light. For the 
relearning S was informed that “some” of the 
light-button pairs had been changed and that 
“some” remained the same as in the training 
trials. For Ss in the stress condition, additional 
instructions were given to the effect that speed 
was the measure of their learning ability, and a 
Standard Electric timer was connected into the 
apparatus and placed so that its face was clearly 
visible to him. The circuit was such that the 
timer was actuated simultaneously with the on- 
set of a light and remained on until S made the 
correct response which automatically broke the 
circuit to both the timer and the light. Incorrect 
responses affected neither the timer nor the light. 
For all groups the relearning trials were adminis- 
tered upon completion of the training trials. 
Every response made to each light, and the order 
in which it was made, was recorded for each S. 


RESULTS 


Error scores of the four groups for 
the first 25 training trials were sub- 
jected to a simple analysis of variance 
to test the hypothesis that they were 
random samples from the same popu- 
lation. For 3 and 104 df, the obtained 
F of 1.12 did not permit rejection of 
the null hypothesis. 

According to the present assump- 
tions, if the dominant habit is incor- 
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TABLE 1 


Errors To CHANGED AND UncHancep S-R Pairs on RELEARNING TRIALS 1-20 
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rect, the consequences of an increase 
in drive should be deleterious, but 
facilitation would be expected if the 
dominant habit is correct. Augmen- 
tation of either of these effects would 
be expected with increases in the 
strength of the dominant habit over 
those competing. Table 1 presents 
the data relevant to these expecta- 
tions based on the first 20 relearning 
trials. It can be seen that in the case 
of the changed S-R pairs (dominant 
habit incorrect) the general effect of 
an increase in drive is deleterious with 
the tendency for this effect to be 
augmented by increases in the 
strength of the habit. However, in 
the case of the unchanged S-R pairs 
(dominant habit correct) the effect of 
an increase in drive for the strong 
habit group is in the direction of 
facilitation. 

A summary of an analysis of vari- 
ance (Lindquist Type III [5] ) based 


2 Although the procedure called for 50 re- 
learning trials, justification lies within the pres- 
ent theory for the expectation that the differ- 
ences expected would be maximal in the early 
trials. Analysis based on the 50 trials disclosed 
that a number of Ss in the stress condition had 
learned the task. Analysis based on the first 10 
trials revealed that not only were all differences 
in the expected direction but that, in comparison 
to the differences based on 20 trials they were in 
most cases appreciably greater. However, the 
scores based on the first 10 trials were too vari- 


able to be reliable. 


on these data is presented in Table 2. 
It can be seen that the over-all dif- 
ferences due to stress are significant 
as are the differences due to the type 
of S-R pair, this latter undoubtedly 
reflecting differences due to the rela- 
tively fewer number of presentations 
of the unchanged pairs over the first 
20 trials. Of particular theoretical 
interest, however, is the significant 
interaction between stress, amount of 
training and the type of S-R pair. 
This may be interpreted as indicating 
that the effects of stress with varying 
amounts of training are not the same 
for the two types of S-R pairs. This 
is in accord with the data presented 
in Table 1, i.e., an increase or decrease 
in errors due to stress depends on the 
type of S-R pair considered. 


TABLE 2 


SumMMARY OF ANALYSIS OF VARIANCE OF 
Error Scores Basep on 











Triats 1-20 

Source df | MS | F P 
Training 1 19.56, — 
Stress 1 | 370.78} 10.96 | <.005 
Training X stress | 1 77.04; 2.28) <.20 
Error (b) 104| 33.82) 
S-R Type 1 |6133.34 |333.44| <.001 
S-R Type X 


_ 


training 04; — 
S-R Type X stress} 1 | 408.38) 22.20) <.001 
S-R Type X 
training X stress} 1 | 190.28) 10.37) <.005 
Error(w) 104; 18.39 
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TABLE 3 


Summary oF ANALYSIS OF VARIANCE OF 
Errors To CuHancep S-R Pairs on 
RELEARNING TRIALS 1-20 























Source df MS F p 
Training 1 6.08; — 
Stress 1 | 778.71} 22.31 | <.001 
Training X stress | 1 | 529.77} 15.18 | <.001 
Error (w) | 104 | 34.91 
| 





Since all F’s based on scores to the 
unchanged pairs were less than unity, 
indicating the lack of reliability of the 
differences shown in this category in 
Table 1, only the results of the analy- 
sis of variance for the changed pairs 
are presented in Table 3. These re- 
sults indicate that the main effect of 
stress was significant while that of 
training was not. However, it is the 
significant interaction between these 
two variables which is of theoretical 
importance. Further analysis indi- 
cated that this interaction was due 
mainly to the differences between 
Groups III and IV (P < .001, F = 
28.35, df = 1 and 52) as compared to 
the difference between Groups I and 
II (P < .20, F = 1.73, df = 1 and 
52). These results are in accord with 
the assumption that these two vari- 
ables, habit strength and drive, com- 
bine multiplicatively in the deter- 
mination of response strength. Other 
analyses indicated that Group III 
performed better than Group I (P < 
05, F = 5.33, df = 1 and 52) and 
that Group II performed better, but 
not significantly better, than Group 
IV (P < .20, F = 2.00, df = 1 and 
52). 


Discussion 


The results of this study tend to be in 
agreement with the implications of the 
assumption relating habit strength and 
drive to response strength. With one 
exception the differences, while not all 
significant, were in the expected direc- 
tion. 


Though these results tend to be in 
accord with the theory, a number of al- 
ternative hypotheses should be con- 
sidered. Since the stréss condition re- 
quired S§ to alter his pace, it could be 
assumed that its main effect was to dis- 
rupt a pacing set with a resulting increase 
in the number of errors. A second ex- 
planation could be based on the assump- 
tion that the stress condition served as a 
distracting element serving to impair 
performance as reflected in increases in 
the number of errors. Still, 4 third al- 
ternative explanation might be based on 
the assumption that the stress instruc- 
tions which emphasized speed may have 
resulted in a set to respond rapidly with 
a disregard for accuracy. On each of 
these assumptions, however, it would 
seem plausible to expect impairment of 
performance to both the changed and 
unchanged S-R pairs. It is clear from 
Table 1 that the effects of stress are 
neither uniformly impairing nor facili- 
tating, but depend on whether the domi- 
nant habit is correct or incorrect. 

In conclusion, these results accord 
some support to the contention that 
stress, as manipulated in this study, 
exhibits some of the properties of drive. 
Additionally, support is given to the 
specific manner in which the variables of 
habit strength and drive are assumed to 
be related in the determination of re- 
sponse strength. The particular manner 
in which the dominant habit was varied 
as to whether it was correct or incorrect 
was such that, on the basis of these as- 
sumptions, both facilitation and impair- 
ment on the same task could be expected 
as a consequence of an increase in the 
drive level. To the extent that these 
expectations were realized, support is 
given to the notion that predictions re- 
garding the quality of performance as a 
function of variations in the motivational 
level (D) requires knowledge of the 
strengths of the specific habits affected 
by such variations and whether the 
dominant habit is correct or incorrect. 


SUMMARY 


Performance as a function of the amount of 
training and stress was studied on the assump- 
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tion that stress, under the conditions of this 
study, affects the level of drive. Impairment of 
performance was predicted as a result of increases 
in drive where the dominant habit was incorrect 
and facilitation where it was correct. Also, it 
was predicted that the degree of impairment or 
facilitation under such conditions would be a 
positive function of the relative strength of the 
dominant habit. 

Of four groups of 27 Ss each, two were given 
25 training trials on a preliminary task, and two 
were given 50 trials. All groups were given 50 re- 
learning trials in which three of the stimulus and 
response elements had to be re-paired while the 
remaining two were left unchanged. One group 
from each of the two training groups relearned 
under stress. Stress was found to increase sig- 
nificantly the number of errors to the changed 
S-R pairs, with this increase being more pro- 
nounced with increased training on these pairs. 
A tendency, though unreliable, was found for 
stress to decrease the number of errors to the two 
unchanged pairs. On the basis of these results, 
it was concluded that predictions regarding the 
effects on performance of variations in the drive 
level requires knowledge of the strengths of the 
specific habits involved and whether the domi- 
nant habit is correct or incorrect. 


REFERENCES 


1. Atper, T. G. Memory for completed and 
incompleted tasks as a function of person- 
ality; an analysis of group data. /. 
abnorm. soc. Psychol., 1946, 41, 403-421. 

2. Farser, I. E., & Spence, K. W. Complex 
learning and conditioning as a function of 
anxiety. J. exp. Psychol., 1954, 45, 120- 
125. 

3. Hutt, C. L. Principles of behavior. 
York: D. Appleton-Century, 1943. 


New 


4. Lazarus, R. S., Degse, J., & Oster, S. F. 
The effects of psychological stress upon 
performance. : Psychol. Bull., 1952, 49, 
293-317. 

5. Linpquist, E. F. Design and analysis of 
experiments in psychology and education. 
New York: Houghton Mifflin, 1953. 

6. Linpstey, D. B. A study of performance 
under speed stress. OSRD, 1945, Publ. Bd. 
No. 18381. Washington, D. C.: U. S&S. 
Dept. of Commerce, 1946. 

7. Montacue, E. K. The role of anxiety in 
serial rote learning. J. exp. Psychol, 
1953, 45, 91-96. 

8. Spence, K. W. Current interpretations of 
learning data and some recent develop- 
ments in stimulus-response theory. In 
Kentucky Symposium, Learning theory, 
personality theory and clinical research. 
New York: Wiley, 1954, Ch. 1. 

9. Spence, K. W., & Farper, I. E. Condi- 
tioning and extinction as a function of 
anxiety. J. exp. Psychol., 1953, 45, 116- 
119. 

10. Spence, K. W., & Farser, I. E. The rela- 
tion of anxiety to differential eyelid con- 
ditioning. J. exp. Psychol., 1954, 47, 
127-134. 

11. Spence, K. W., & Taytor, J. A. Anxiety 
and strength of the UCS as determiners of 
the amount of eyelid conditioning. /. 
exp. Psychol., 1951, 42, 183-188. 

12. Taytor, J. A. The relationship of anxiety 
to the conditioned eyelid response. /. 
exp. Psychol., 1951, 41, 91-92. 

13. Taytor, J. A., & Spence, K.W. The rela- 
tionship of anxiety level to performance 
in serial learning. J. exp. Psychol., 1952, 
44, 61-64. 


(Received October 8, 1954) 





"ves eee OF 





Journal of Experimental Psychology 
Vol. 50, No. 3, 1955 
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STIMULUS PREDIFFERENTIATION AND 
PERFORMANCE SET! . 


JOAN H. CANTOR 
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Recent research has indicated a 
growing interest in the effects of 
various kinds of preliminary verbal 
training upon the later acquisition of 
skill in performing a motor task. A 
number of investigators (1, 2, 3, 5, 6, 
7, 8, 10) have studied the effects of 
pretraining in which S gains knowl- 
edge about the motor task stimuli. 
This pretraining has generally con- 
sisted of verbal paired-associates learn- 
ing in which the stimuli are the same 
as, or substitutes for, those of the 
motor task, while the verbal responses 
are unrelated to the required motor 
responses. Increased discriminability 
among the motor task stimuli, as a 
result of the pretraining, has been 
referred to either as stimulus pre- 
differentiation or acquired distinctive- 
ness of cues. Several studies using 
stimulus predifferentiation training 
have demonstrated positive transfer 
to the motor task (1, 2, 5, 7, 8), but 
two investigators have found evidence 
of an interfering effect (1, 10). 

Superior motor performance has 
also been shown to be a result of the 
transfer of nonspecific factors follow- 
ing paired-associates learning in which 
both the stimuli and responses are 
unrelated to those of the motor task 
(6). Because of these effects of “‘per- 


1 This article is based on a dissertation sub- 
mitted to the Graduate College of the State 
University of Iowa in partial fulfillment of the 
requirements for the Ph.D. degree. The writer 
is indebted to Dr. Don Lewis for advice and as- 
sistance throughout the course of the investi- 
gation. 

2 Now at George Peabody College for Teach- 
ers, Nashville, Tenn. 


formance set,” an “irrelevant” * pre- 
training group is necessary as a control 
in the investigation of stimulus pre- 
differentiation. ’ 

In a systematic investigation of 
stimulus predifferentiation and per- 
formance set, it is important to deter- 
mine the relationship between amount 
of pretraining and subsequent motor 
performance. 


Gagné and Baker (7) gave three amounts of 
relevant pretraining prior to a motor task re- 
quiring the operation of a particular toggle 
switch for each of four colored stimulus lights. 
The group receiving the largest amount of pre- 
training performed significantly better on the 
motor task than did a group receiving none. 
However, this facilitation could be attributed to 
the effects of performance set as well as to stimu- 
lus predifferentiation. 

Baldwin (1) gave two amounts of relevant 
pretraining prior to performance on a simple 
discrimination task. For the larger amount, Ss 
given relevant pretraining made significantly 
more correct responses on the motor task than 
did a group given the same amount of irrelevant 
pretraining. 


The purpose of the present study 
was to investigate the effects of three 
amounts of relevant pretraining upon 
the subsequent performance of a per- 
ceptual-motor task. The study was 
also designed to obtain information 
concerning the influence of the same 


3 The terminology used is the same as that 
suggested by McAllister (10). The word “ir- 
relevant” refers to the specific stimulus and 
response elements in the paired-associates task 
and does not imply that all aspects of the pre- 
training task are unrelated to the motor task. 
Stimulus predifferentiation training has been 
called “relevant stimulus” pretraining and is here 
abbreviated to “relevant”’ pretraining. 
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amounts of irrelevant 
upon motor performance. 

The Star Discrimeter provided the 
criterion task in which S learned a 
particular motor response for each of 
six similar colored lights. The S 
moved a wobble stick into one of six 
horizontal channels when the corres- 
ponding color was presented. During 
relevant pretraining, Ss learned to as- 
sociate a letter with each of these 
colors, and those given irrelevant pre- 
training associated the same letters 
with a different set of colors. 

One of the earliest explicit state- 
ments of the mechanism underlying 
stimulus predifferentiation is that of 
Miller and Dollard (11, p. 73). Ap- 
plying their formulation to the present 
situation, it is assumed that the verbal 
responses learned during relevant pre- 
training transfer to the motor task. 
Presumably, these verbal responses 
have stimulus components, and each 
light stimulus therefore has a constant 
additional cue which serves to dis- 
tinguish it from the other stimuli in 
the group. The result is decreased 
generalization among the colors and 
faster learning of the motor task. 
With increased amounts of relevant 
pretraining, the probability that the 
correct verbal responses will transfer 
to the motor task should increase, 
resulting in increased discrimination 
and thus greater facilitation. 

Performance set involves the trans- 
fer of less specific responses and in- 
cludes such factors as warm-up and 
learning-to-learn. Therefore, Ss given 
irrelevant pretraining may show mo- 
tor performance superior to that of Ss 
given none. 

On the basis of theory and available 
data, certain predictions were made at 
the outset of the experiment: (a) Rele- 
vant pretraining would result in posi- 
tive transfer to the motor task for all 
three amounts of pretraining. Spe- 
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Fic. 1. Basic elements of the Star 


Discrimeter. 


cifically, it was believed that the rele- 
vant groups would show performance 
superior to that of the irrelevant 
groups. (b) The amount of positive 
transfer to the motor task would in- 
crease with increasing amounts of 
relevant pretraining. (c) Irrelevant 
pretraining would result in positive 
transfer to the motor task for all three 
amounts of pretraining. All irrele- 
vant groups were expected to show 
better motor performance than that 
of a no pretraining group.‘ 


METHOD. 


Apparatus and procedure.—The motor task 
was provided by the Star Discrimeter, the Iowa 
version of a device described by Duncan (4). 
As seen in Fig. 1, six channels are cut into a 
horizontal steel plate so that they radiate out at 
60° angles from a center opening. A steel wobble 
stick protrudes from the center opening and may 
be moved freely into any one of the channels. 
Mounted at about eye level behind and slightly 
to the left of the wobble stick is a black stimulus 
box. A circular piece of opal glass is placed in 
the opening near the top edge of the box. Any 
one of six colors may be diffused onto the glass 
from inside the box. A 50-point stepping relay 
is used to present a continuous random sequence 
of the six colors. 

The colors consisted of red, orange, and yellow 
hues. Although it was not possible to obtain a 
completely continuous series of hues, each color 


4 No prediction was made regarding the rela- 
tive effects of larger amounts of irrelevant pre- 
training upon performance set. The writer 
knows of no empirical evidence concerning these 
effects in a situation using pretraining with a 
motor task. 
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was quite similar to at least one other color in the 
series. 

The S stood in front of the apparatus, holding 
the wobble stick with the preferred hand. The 
task was to learn which of the six colors was 
associated with each channel and to move the 
wobble stick into as many correct channels as 
possible during each trial period. The S con- 
tinued moving the stick into channels until the 
correct channel was entered, resulting in the ap- 
pearance of a new color. Microswitches placed 
in the channels activated correct response, shal- 
low, and deep error counters. 

Pretraining was given in the same location as 
motor training. The colors were presented in 
the manner described above, and, for the rele- 
vant groups, were identical to those of the motor 
task. For the irrelevant groups, a series of blue- 
green hues was used. The two sets of colors had 
been roughly equated for discriminability during 
preliminary work. 

An automatic slide projector placed directly 
in front of the stimulus box was used to present 
the letters. A series of timers provided auto- 
matic synchronization of the stepping relay on 
the Star and the slide projector. Each color in 
the sequence was presented for 4 sec. A blank 
slide was presented during the first 2 sec. fol- 
lowed by a slide containing the letter associated 
with that color during the last 2 sec. The letter 
appeared directly underneath the color. The 
procedure was therefore exactly like paired- 
associates learning. A random sequence of 36 
colors was used for each block of stimuli, every 
color being presented six times. 

The group without pretraining received in- 
structions and immediately began practice on 
the Star Discrimeter. The procedures were the 
same for all pretraining groups except that the 
relevant groups associated letters with the red- 
yellow hues, and the irrelevant groups with the 
blue-green hues. The instructions explained 
that S was to learn to discriminate similar colors 
by associating a particular letter with each of the 
colors and pronouncing the correct letter before 
it was shown. The pretraining groups were 
given 2, 4, or 12 blocks of stimulus presentations 
with the appropriate colors. Two-minute rests 
were given after every three blocks. At the end 
of pretraining, S left the room for a 2-min. rest 
while E set up the motor task. The projector 
was rolled away and the wobble stick was un- 
covered for the first time. For the irrelevant 
groups, the colors were changed. There were 40 
trials on the Star Discrimeter, each trial lasting 
20 sec., with 10-sec. intertrial intervals. A 2- 
min. rest was given after 20 trials. 

Subjects—Women volunteers from a course 
in introductory psychology were used as Ss, and 
20 women were randomly assigned to each of the 
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GROUPS OF FOUR TRIALS 


Fic. 2. Means per trial in blocks of four 
trials on the Star Discrimeter for the combined 
relevant group and the combined irrelevant 
group. 


seven groups. All Ss were required to read the 
numbers correctly on Cards 1-9 and 12-13 of the 
Ishihara color test. Seventeen Ss were dis- 
carded because of color blindness, apparatus 
breakdown, or failure to follow instructions. 


RESULTS 


Pretraining performance.—For each 
amount of pretraining, no significant 
difference was found between the rele- 
vant and irrelevant groups in either 
total number of correct responses or 
in the number of correct responses on 
the last pretraining block.5 There- 
fore, differences in motor performance 
could not reasonably be attributed to 
possible differences in the difficulty of 
the pretraining task. 

Motor performance.—Means per 
trial of correct responses and of total 
errors for the three relevant groups 
combined and the three irrelevant 
groups combined are plotted against 
blocks of four trials in Fig. 2. (Total 
error scores were the sum of deep and 


5 Whenever possible, the assumption of homo- 
geneity of variance was tested and was found to 
be tenable in all cases. A 5% coefficient of risk 
was adopted for all statistical analyses. 
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NUMBER OF PRE-TRAINING BLOCKS 
Fic. 3. General means per trial over Trials 


1-40 on the Star Discrimeter for each pretraining 
group. 


shallow errors.) General means per 
trial over Trials 1-40 are presented in 
Fig. 3. 

The correct response and error data 
for all 40 trials were analyzed for the 
six pretraining groups according to 
Lindquist’s (9) Type III design, the 
variables being type of pretraining, 
amount of pretraining, and trials. 
The results were essentially alike for 
the two measures of performance. 
The difference between the relevant 
and irrelevant groups was significant 
for both correct responses (F = 5.96, 
df = 1, 114) and total errors (F = 
4.91, df =1, 114). No significant 
differences were found for amount of 
pretraining, nor were the interactions 
between type and amount of pre- 
training significant. The trials effect 
was highly significant, as indicated by 
the curves in Fig. 2. The interaction 
between type of pretraining and trials 
was significant for both correct re- 
sponses (F = 1.79, df = 39, 4446) and 
total errors (F = 1.94, df = 39, 4446), 
showing that the trends in Fig. 2 are 
not parallel over all 40 trials. 

No significant differences in motor 
performance were found among the 
irrelevant and no pretraining groups. 


Discussion 


The data support the first prediction 
that all three amounts of relevant pre- 
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training would have a facilitative effect. 
As seen in Fig. 2, the superiority of the 
combined relevant group was greatest 
during the middle trials, leading to the 
significant interaction between trials and 
type of pretraining. The point in learn- 
ing at which facilitation should begin to 
be evidenced is a difficult one to deter- 
mine. Decreased generalization among 
the stimuli is not expected to be facili- 
tating until at least some of the correct 
associative connections are established 
in the motor task. Before any correct 
associations are formed, the identification 
of the individual colors should not result 
in more correct responses or fewer errors. 
However, as soon as some learning 
occurs, increased discriminability of the 
stimuli should allow for faster elimina- 
tion of incorrect responses. The present 
data showed an increase in the superi- 
ority of the relevant group followed by a 
decrease as the performance asymptote 
was approached. Presumably, the dif- 
ferences would have completely disap- 
peared had practice been continued. 

Contrary to the second prediction, the 
amount of facilitation did not increase 
with increased amounts of relevant pre- 
training. The expectation was that the 
relevant-irrelevant trend lines (see Fig. 
3) for correct responses and for total 
errors would diverge, the effect being 
reflected by significant interactions be- 
tween type and amount of pretraining. 
However, the hypotheses of parallel 
trends could not be rejected. The facili- 
tation for four pretraining blocks was 
considerably greater than that for two 
blocks, as predicted. However, the 
largest amount of relevant pretraining 
resulted in relatively little positive 
transfer. 

Although it now appears clear that 
stimulus predifferentiation can be shown 
to be facilitative, the variables deter- 
mining the amount of facilitation are 
difficult to specify. Since the facilitation 
depends upon the transfer to the motor 
task of responses learned during pre- 
training, amount of pretraining should be 
an important variable. The results sug- 
gest that the functional relationship be- 
tween amount of pretraining and amount 
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of facilitation is not a simple one, and 
that other relevant variables must be 
identified before the nature of the func- 
tion can be determined. 

No verification of the third prediction 
was found. The motor performance of 
the irrelevant and no pretraining groups 
was essentially the same. Apparently 
any transfer of nonspecific factors to the 
motor task which occurred did not facili- 
tate motor performance. Further sys- 
tematic studies of the relevant variables 
are needed to explain the differences in 
experimental findings regarding per- 
formance set. 


SUMMARY 


The purpose of this study was to investigate 
the effects of three amounts of verbal pretraining 
upon performance of a perceptual-motor task. 
In the motor task provided by the Star Discrim- 
eter, Ss learned to move a wobble stick into 
channels corresponding to six colored lights con- 
sisting of red-yellow hues. Three groups of 20 
women received either 2, 4, or 12 blocks of rele- 
vant pretraining in which they learned to as- 
sociate a letter with each of the six colors. Three 
irrelevant pretraining groups were given corre- 
sponding amounts of practice in associating 
letters with a blue-green series of colors. An 
additional group had no pretraining prior to the 
motor task. All Ss were given 40 trials on the 
Star Discrimeter. The performance measures 
were correct responses and total errors per 20- 
sec. trial. 

The predictions made and the results ob- 
tained were as follows: 

1. The data supported the prediction that all 
three amounts of relevant pretraining would 
result in positive transfer to the motor task. 
The performance of the combined relevant 
groups was reliably superior to that of the com- 
bined irrelevant groups. This facilitation was 
attributed to decreased generalization among the 
stimuli of the motor task. 

2. The prediction that amount of facilitation 
resulting from stimulus predifferentiation would 
increase with increasing amounts of pretraining 
was not verified. 
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3. The prediction that irrelevant pretraining 
would facilitate motor performance was not 
verified. 
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TRANSPOSITION IN THE WHITE RAT AS A FUNCTION 
OF STIMULUS COMPARISON ! 


ROBERT THOMPSON 


Louisiana State University 


Recent work in discriminative learn- 
ing has revealed a number of dif- 
ficulties for the two main interpre- 
tations of transposition behavior. 
The theory held by Kohler (4) and 
Lashley (5) which assumes that S 
responds to the relationship existing 
between the stimuli suffers from the 
findings that this cue can be experi- 
mentally abolished without disturbing 
the discriminative habit (2, 3, 11). 
In these studies, the habit was ap- 
parently maintained on the basis of 
the absolute properties of the stimu- 
lus objects. Similarly, the alternate 
theory of Spence (9, 10) proposing 
that S responds to the specific aspects 
of the stimulus situation receives little 
support from the results obtained by 
Nissen, Levinson, and Nickols (7), 
and by Saldanha and Bitterman (8). 
These investigators showed that re- 
lational cues seem to operate in the 
mastery of certain discrimination 
problems. 

In the light of this conflicting evi- 
dence, it is not unreasonable to assume 
that S learns to respond to both the 
relative and absolute properties of the 
stimulus situation. Nissen et al. (7) 
came to a similar conclusion in ex- 
plaining the performance of chim- 
panzees on a complex discrimination.” 


1This paper represents a portion of a dis- 
sertation submitted to the Faculty of the Grad- 
uate School of the University of Texas in partial 
fulfillment of the requirements for the Ph.D. 
degree. The author is indebted to Dr. H. C. 
Blodgett for advice during the course of this 
study. 

2The majority of relational theorists have 
mentioned the existence of response to the 
absolute properties (4, 5), but relegate it to an 
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This interpretation is also applicable 
to transposition behavior, e.g., the 
degree of transposition being a func- 
tion of the interaction between re- 
lational and specific learning. Previ- 
ous studies have been primarily 
interested, however, in the effect of 
only one variable on transposition 
performance, and such data are in- 
adequate to evaluate the necessity of 
positing response to two different cues. 

The aim of the present experiment 
was to study the effects of three vari- 
ables on transposition performance. 
The degree of comparison of training 
stimuli appears to be an important 
variable as shown by Baker and 
Lawrence (1). These investigators, 
using a Grice apparatus, failed to 
demonstrate transposition in rats 
whose training did not afford them 
direct comparison of the training 
stimuli. Although these results sup- 
port a relational learning theory, the 
authors admit that they are not 
crucial because of the difficulties in- 
herent in the apparatus employed. 
Weiss and Stevenson (12) recently 
investigated the effect of delayed 
testing on transposition in adult 
human Ss. They found that in- 
creasing amounts of time between 
attainment of criterion and the test 
tended to increase the degree of trans- 
position. The third variable about 
which the present study was con- 
cerned dealt with the degree of dif- 
ficulty of the training problem, a 


insignificant role in view of the fact that memory 
of relations shows more permanence and stabil- 
ity than memory of absolute properties. 
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variable which has not received syste- 
matic study in animals. 


METHOD 


Subjects and Apparatus 

Sixty-four experimentally naive albino rats, 
32 males and 32 females, were used as Ss. Their 
ages ranged from 60 to 90 days at the beginning 
of the experiment. All Ss were handled for 
approximately three weeks prior to the beginning 
of experimental training. During the last week, 
each S was placed on a 23-hr. feeding schedule. 

A conventional two-window Lashley jumping 
apparatus was employed, each window meas- 
uring 4.5 in. square. Two feet below the 
windows was a cloth net into which the S fell 
upon making an incorrect response. Behind the 
windows was a feeding platform where a constant 
supply of wet mash was provided throughout the 
experiment. The entire apparatus was painted 
a flat black. 


Procedure 


Preliminary training.—The Ss were fed on the 
food platform for a period of one week, during 
which time they became accustomed to the ap- 
paratus through exploration. At the end of this 
period Ss were trained for five days to jump 
through open windows. At first the jumping 
platform was moved in so that the gap between 
it and the windows was small, enabling S to 
walk through the open windows. Gradually 
the gap was increased to a maximal distance of 
9 in. Later all Ss were trained to jump through 
vertical black- and white-striped cards (stri- 
ations .5 in. in width) which were placed un- 
locked against the windows. During the entire 
preliminary training period Ss were given guided 
trials, if necessary, to prevent the development 
of a consistent preference for either of the two 
windows. 

Training pairs.—Following preliminary train- 
ing, Ss were divided at random into two groups 


TABLE 1 


ARRANGEMENT OF Pairs FOR THE COMPARISON 
AnD NoncompaRison Groups * 








Comparison Group Noncomparison Group 





es _ + sa 
Light gray | Dark gray | Light gray |Large circle 
ve ss pie 


Small circle| Large decent circle} Dark gray 





* Circles were white outlines } in. wide painted on 
black backgrounds. 
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TABLE 2 


StimuLus VALUES FoR THE Carps UsEep 
IN THE BRIGHTNESS AND SIZE 














DiscRIMINATIONS 
Brightness Size 
Discrimination Discrimination 
(Per Cent (Outside 
Problem Reflectance) Diameter, In.) 
L+ D —- c+ c- 
Easy 68 25 2.50 4.25 
Difficult 68 43 2.50 : 3.50 

















of 32 Ss each. Each group was trained con- 
currently on two separate pairs of stimulus cards, 
the members of one pair differing only in re- 
flectance, the members of the second pair 
differing only in size of circle outlines. One 
group (Comparison Group) received training on 
the two pairs which afforded them direct com- 
parison between cards in the same dimension. 
That is, the members of the brightness pair or 
the members of the size pair appeared together 
in the two windows. The second group (Non- 
comparison Group) was trained on the two 
pairs so that direct comparison of cards in the 
same dimension was impossible. In this case, 
appearing in the two windows were either one 
member of the size pair and one member of the 
brightness pair, or the other two members of 
each pair. The technique employed was the 
same as that used by Saldanha and Bitterman 
(8). Table 1 illustrates the training pairs for 
the Comparison and Noncomparison Groups. 

Half of the Ss in each main group received 
training on an “easy” problem, the other half 
received training on a “difficult” problem. The 
problems were made to vary in difficulty by 
decreasing or increasing the intrapair difference 
in reflectance for the brightness pair and in 
diameter for the size pair. Table 2 gives the 
stimulus values in reflectance and the sizes of 
circle diameter for the two pairs involved in the 
easy and difficult problems. 

A total of 12 trials was given each day, 6 trials 
on each pair. The pairs were presented in a 
prearranged random order. The positive card 
of each pair appeared on the right for three trials 
and on the left for the other three trials. A 
response to the positive card of either pair ad- 
mitted S to the rear platform, whereas a response 
to the negative card, which was locked, caused S 
to fall into the net. The method of correction 
was employed throughout, each trial therefore 
terminating in a correct response. If S failed 
to give a correct response in three “free” jumps, 
S was manually guided to the positive card. 
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Training was continued until S made no more 
than two errors in two consecutive days (22 
errorless responses in 24 trials with no more than 
one error on each pair). 

Transposition test—Upon mastering the 
training pairs, the transposition test was given. 
Two different time intervals between the mastery 
of the training pairs and the first 12 trials on the 
transpositional stimulus cards were employed. 
Half of the Ss in each subgroup were tested im- 
mediately upon reaching criterion. The re- 
maining half were tested 24 hr. after meeting 
criterion. 

The transposition test, which was given in the 
positive direction, consisted of the pairing of a 
lighter stimulus card (85% reflectance) with the 
previously positive light card (68% reflectance), 
and the pairing of a smaller circle (1.75 in. 
outside diameter) with the previously positive 
small circle (2.5 in. outside diameter). ‘The 
positive card of each pair during the transpo- 
sition test was the lighter stimulus card and the 
smaller circle. The stimulus cards were paired 
so that there would be direct comparison be- 
tween cards on the same dimensions. 

The experimental procedure used in the 
transposition test was the same as that used in 
the training period. The brightness pair and 
the size pair were learned concurrently. Twelve 
trials were given each day with six trials on each 
pair. The two pairs were presented in random 
order, the positive card of each pair appearing 
on the left for half of the trials. Training was 
terminated when S committed no more than two 
errors in two consecutive days. 


RESULTS 


Original learning.—Before discuss- 
ing the results of the transposition 
test, it is necessary to examine the 
performance of the Comparison and 
Noncomparison Groups on the easy 
and difficult problems in the original 
learning. Table 3 gives this in- 
formation in terms of days to criterion 
and total errors. These data show 
that the manipulations employed to 
provide two training problems dif- 
fering in difficulty were successful. 
For both the Comparison and Non- 
comparison Groups, the easy problem 
required significantly fewer days to 
master (t’s = 5.36 and 5.64; P’s = 
001) and resulted in significantly 
fewer errors (t’s = 5.18 and 5.09; 


TABLE 3 


PERFORMANCE Scores ON TRAINING ProBLEMS 
FOR COMPARISON AND 
NoNCOMPARISON 
Groups 








Days to Criterion Total Errors 





Problem | N 





Mean SD Mean SD 














Comparison Group 





Easy 16 6.0 1.52 49.2 | 15.31 
Difficult | 16} 10.5 2.91 81.8 | 19.76 








Noncomparison Group 





Easy 16 6.2 1.93 47.5 | 19.54 
Difficult |16| 11.5 3.16 89.0 | 25.98 




















P’s = .001) than the difficult prob- 
lem. 

More importantly, there were es- 
sentially no differences between the 


TABLE 4 
PercentaGE Correct RESPONSES ON First 
TRIAL AND ON First Day on THE 
BRIGHTNESS ANB SIZE 











DIscRIMINATIONS 
First Trial First Day 
Condition N Bright- Bright- 


Size ness Size ness 
Pair Pair Pair Pair 








Comparison ite la” iwie 
Non- 
comparison 32| 63 | 72* 62 | 72** 











Problem 
Easy 32| 66 | 72* 66 | 75** 
Difficult 32| 69* | 75** | 66 | 80*** 

















Time of Testing 




















Immediate 32166 | 69* 64 | 74** 
Delayed 32 | 69* | 78*** | 68* | 81*** 
*P = OS 
*P = O1 
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comparison and noncomparison pro- 
cedures, a result contrary to that 
obtained by Saldanha and Bitterman 
(8). The differences between the 
Comparison and Noncomparison 
Groups on the easy problem were 
very unreliable. On the difficult prob- 
lem, the Comparison Group tended 
to be superior to the Noncomparison 
Group in terms of both days to cri- 
terion and total errors, but the re- 
sulting P values fell short of the .05 
level of significance. 

Trans position test.—One of the sig- 
nificant indices of transposition is 
performance on the first test trial with 
the new pair of stimuli. In the 
present study the use of two different 
training pairs therefore provided two 
such test trials—one on the size pair 
and one on the brightness pair. 
Table 4 presents the percentage of 
correct responses on the first test trial 
under each major experimental con- 
dition. The P values were calculated 
by the ¢ test for proportions. It can 
be seen that there was significant 
transposition under each condition on 
the brightness pair. There was also 
significant transposition on the size 
pair, although the significance levels 
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are marginal for the noncomparison 
procedure, the easy problem, and the 
immediate testing conditions. 

In terms of combined performance 
on both pairs for the first trial, there 
was no significant difference in trans- 
position between the comparison and 
the noncomparison conditions (P = 
.50). Similarly, no significant differ- 
ence in the number of relative choices 
occurred between the easy and dif- 
ficult problems (P = .70), or between 
the immediate and delayed testing 
conditions (P = .50). 

Table 4 also provides the percentage 
of correct responses on the first day 
in terms of initial jumps on the six 
test trials with each pair. With this 
second index of transposition, it can 
be seen that a significant. number of 
relative choices was obtained under 
all experimental conditions on the 
brightness pair. On the size pair, 
however, only the comparison and 
delayed testing conditions resulted in 
significant transposition, whereas the 
levels of significance for the other ex- 
perimental conditions fell short of 
statistical significance. 

The differences between the related 
conditions (comparison versus non- 


TABLE 5 


ANALYSES OF VARIANCE OF ToTaL Errors on Frist Day, Days To CRITERION, 
AND ToTaL Errors To CRITERION 






































Total Errors on Days to Total Errors to 
First Day Criterion Criterion 
Source of Variation df 
M M M 
Square P Square F Square F 
Modes of presentation (P) 1 51.0 7.61** 85.2 18.89*** | 1507.1 20.67*** 
Degrees of difficulty (D) 1 1.2 19.3 4.30* 302.3 ‘4.09* 
Time of testing (T) 1 16.0 2.40 4.9 1.09 74.6 1.01 
PxXD 1 2.2 1.6 16.3 
aes 1 3.1 1.1 41.8 
TXD 1 28.3 ao 2.6 166.3 2.25 
Lame 1 10.4 65 3.1 71.3 
Within 56 6.67 4.5 73.9 
Total 63 
*P = 0S. 
+P = 01. 
P= 001. 
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comparison procedures, easy versus 
dificult problems, and immediate 
versus delayed testing conditions) in 
terms of combined correct responses 
on the first day did not approach the 
05 level of significance. 

Table 5 summarizes the analyses of 
variance for total errors on the first 
day, total errors to criterion, and trials 
to criterion. For the three analyses, 
the eight groups are of homogeneous 
variance as shown by Bartlett’s test. 
On the basis of these measures of 
performance, the comparative pro- 
cedure was superior to the noncom- 
parative procedure in learning the 
transposition problem. The signifi- 
cant F’s of 4.30 and 4.09 for days to 
criterion and total errors to criterion, 
respectively, show that the Ss trained 
on the difficult problem performed 
more efficiently on the transposition 
pairs than the Ss trained on the easy 
problem. Of particular interest is 
the significant interaction existing 
between time of testing and problem 
difficulty for total errors on the first 
day of the transposition test. In- 
spection of the data reveals that, for 
the easy problem, delayed testing 
shows more efficient performance than 
immediate testing. That is, the ef- 
fectiveness of the delayed testing 
condition is limited to the easy 
problem. 


Discussion 


One of the principal findings of this 
experiment was the reliable evidence 
of transposition in the Noncomparison 
Group. The Ss in this group exhibited 
a very significant degree of transposition 
on brightness in terms of performance 
on the first trial as well as in terms of 
performance on first day. On size, 
however, only a marginal degree of 
transposition was found. The fact that 
the noncomparison Ss tended to respond 
relationally to the transpositional stimu- 
lus cards stands in contradistinction to 
the results reported by Baker and 


Lawrence (1). These investigators, using 
a Grice discrimination apparatus, found 
that the successively trained group (Non- 
comparison Group) not only failed to 
show transposition, but exhibited a slight 
tendency to respond to the absolute 
characteristics of the test situation. 

At least two differences in procedure 
between the present study and that of 
Baker and Lawrence may be responsible 
for the conflicting findings. In the first 
place, the successive discrimination prob- 
lem used during the training period re- 
quired that only one of the two discrimi- 
nanda appear on each trial. This one- 
choice situation used by Baker and 
Lawrence contrasts sharply with the 
two-choice situation employed for the 
noncomparison Ss in the present study. 
It is quite possible, as Baker and Law- 
rence admit, that successively trained 
Ss may acquire different receptor orien- 
tation habits in the one-choice situation 
than in the two-choice situation. Sec- 
ondly, the test for transposition in the 
Baker and Lawrence study required 
switching the successively trained Ss to 
a two-choice simultaneous situation, 
whereas the present study preserved the 
two-choice situation throughout both 
phases of the experiment. Thus there 
remains the possibility that the different 
receptor orientation responses, as well as 
the switching from the one-choice to the 
two-choice situation, may be responsible 
for the absence of transposition in the 
successively trained group employed by 
Baker and Lawrence. 

The appearance of transposition ex- 
hibited under the noncomparative con- 
dition is consistent with the theory of 
Spence (9,10). It might seem, however, 
that these data oppose a relational learn- 
ing theory (4, 5), since direct comparison 
of the training stimuli on the same con- 
tinuum was impossible. The relation- 
ship between the stimuli may have been 
built up under the noncomparative con- 
dition if S compared sensory impressions 
with previous traces. This interpre- 
tation would hold only when the inter- 
trial interval is brief. The poorer trans- 
position learning scores on the test 
problem for the noncomparative Ss may 
be handled by both theories on the basis 
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of negative transfer of receptor orien- 
tation responses. 

There was a tendency, although in- 
significant, for the Ss trained on the 
difficult problem to exhibit more trans- 
position in terms of first-trial and first- 
day performance than the Ss trained on 
the easy problem. In terms of trans- 
position learning measures, this difference 
became significant. The degree of trans- 
position as a function of the difficulty of 
the training problem has not received 
attention by either of the two main 
theories. This variable should be taken 
into account when a generalized theory 
of discrimination learning and _ trans- 
position is developed. 

Although the present data do not 
provide critical evidence against either 
the relational or the specific stimulus 
theories, they can perhaps best be as- 
similated in a theory which acknowledges 
the occurrence of both absolute and re- 
lational leaning. Such a theory would 
suggest a line of further experimentation 
and would emphasize interactions among 
variables which have, heretofore, been 
neglected. 


SUMMARY 


This experiment was designed to study the 
effects of comparison of training stimuli, the 
degree of difficulty of the training pairs, and the 
time of testing on transpositional behavior. 

Sixty-four albino rats were given training on 
a jumping stand to discriminate between two 
discrete pairs of stimulus cards, the members of 
one pair differing only in reflectance, and the 
members of the other pair differing only in size 
of circle outline. Thirty-two Ss received train- 
ing which afforded them direct comparison of 
stimuli on the same dimension. The remaining 
32 Ss were trained so that direct comparison of 
the stimulus objects in the same dimension was 
impossible. Half of the Ss in each group was 
trained on “easy” pairs; the other half was 
trained on “difficult” pairs. Test for transpo- 
sition was given either immediately or delayed 
for 24 hr. 

The following results were obtained: 


1. Significant transposition was obtained 


under both the comparative and noncomparative 
conditions in terms of first trial and first jump 
on the six trials on the first day. 

2. In terms of transposition learning” scores, 
the Comparison Group yielded more transpo- 
sition than the Noncomparison Group (P =.01). 


ROBERT THOMPSON 


3. In terms of total errors on first day, there 
was a significant interaction between degree of 
difficulty and time of testing (P = .05). Only 
on the easy problem was delayed testing superior 
to immediate testing. 

4. The difficult problem yielded more trans- 
position than the easy problem with respect to 
days to criterion (P = .05) and total errors to 
criterion (P = .05) in transposition learning. 
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MOVEMENT ERROR, PRESSURE VARIATION, AND 
THE RANGE EFFECT! 


BERNARD WEISS? 


University of Rochester 


A motor response that cannot be 
guided by external (visual) feedback 
may be guided by internal (proprio- 
ceptive) feedback. In an earlier study 
(8), proprioceptive feedback was sys- 
tematically varied by varying the 
maximum pressure and displacement 
needed to perform a stick positioning 
task. The task required adjustments 
to be made after the original visual 
cues were withdrawn. Two sets of 
pressure-displacement conditions were 
used to obtain the variation in pres- 
sure and displacement. In one set of 
conditions the maximum pressure re- 
quired on the control stick was kept 
constant at 30 lb. and the maximum 
displacement was varied, taking on 
values of 3, 6, 12, 18, 24, and 30 
degrees. In the other set, maximum 
displacement was kept constant at 
30° and maximum pressure was varied 
from 0 to 30 lb., in 6-lb. steps. Pres- 
sure variation produced no significant 
effect on constant error (CE) and 
variability in the set of conditions in 
which displacement remained con- 
stant and pressure was varied. When, 
on the other hand, pressure remained 
constant and displacement was varied, 
CE and relative variability decreased 
as the maximum displacement in- 
creased. The experimental design, 
however, did not permit a definite 
function to be assigned to pressure 


1'This study was conducted as part of a 
research program on human motor performance 
supported by a contract between the Special 
Devices Center, Office of Naval Research, and 
the University of Rochester, with Dr. S. D. S. 
Spragg as responsible investigator. 

2Now at the USAF School of Aviation 
Medicine, Randolph Field, Texas. 
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cues in the positioning settings made 
with the smaller displacement ranges 
(less than 7.5°) since maximum pres- 
sure for these ranges was constant. 

The earlier study also showed that 
the differences among these conditions 
mainly arose from responses to the 
shortest displacement within a con- 
dition. This result was attributed 
partly to the “range effect,” a phe- 
nomenon observed by many psy- 
chologists, and investigated in motor 
performance by Ellson and Wheeler 
(2), Ellson and Coppock (3), and Slack 
(6), among others. The range effect 
describes a tendency for Ss to over- 
estimate a particular displacement 
when it is the shortest in a range of 
stimuli, and to underestimate it when 
it is the longest. ; 

The present study, therefore, had 
two purposes: first, a more precise 
investigation of the role played by 
pressure variation in positioning set- 
tings with displacement ranges less 
then 7.5°; second, some assessment 
of the extent to which the range effect 
had produced the results of the prev- 
ious study (8). 


METHOD 


Four conditions were used. In each con- 
dition the maximum displacement required was 
6°. The different conditions were distinguished 
by their maximum pressures. ‘These four values 
were 0, 7.5, 15, and 301b. Pressure and distance 
were linearly related. 

The S’s task, as in the previous study, was to 
compensate for the displacement of a spot of 
light from the center of an oscilloscope screen. 
The spot flashed on for .6 sec., then was with- 
drawn for 2.5 sec. During the “blackout” 
period, S moved a 20-in. stick control the dis- 
tance and in the direction (either fore or aft) 
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TABLE 1 


Constant Error 1n DEGREES FOR THE MiIxEep AND ConsTANT SERIES 
AS A Function oF ANGLE AND PRESSURE 








Constant Series 








Mixed Series 

Mean 

Angle ] (Both 
ub | SRS | SAE | GHP | mean | ak. | GES | AE | GRP | meen | 

Ls” 038; .122| .211 | —.015 .089j} .241 .377| = .307| «413 334 212 
i -153| .158] .094 .001 -102}) 011 .040; .024} .130) .051 .076 
4.5° 170} .085| .063 |} —.110} .052!) —.132 | —.341| —.211| —.135| —.205| —.076 
6° — .077 | —.025| .088 | —.030) —.011 | —.071 | —.142 | —.342 | —.450| —.251| —.131 





Mean 071} .085| .114 | —.038| .058 
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that he expected would return the spot to the 
center of the screen. At the end of this period, 
the spot reappeared for 2 sec. to give S an oppor- 
tunity to correct his error. After a 5-sec. 
“blackout” interval, another displacement ap- 
peared on the screen. A more detailed descrip- 
tion of the apparatus and task is given elsewhere 
(8, p. 216). 

There were four possible displacements in 
each direction (push and pull). These corres- 
ponded to 1.5, 3, 4.5 and 6 degrees, or 25, 50, 
75, and 100% of the maximum distance. 

Subjects—All 8 Ss were graduate students or 
staff members in the psychology department of 
the University of Rochester and ranged in age. 
from 23 to 32 years. None of them had served 
as Ss in the previous study. 

Procedure.—To obtain a measure of the in- 
fluence of the range effect, two series of displace- 
ments were used. In the mixed series, the 
sequence of presented displacements was es- 
sentially random, as was that used in the prev- 
ious study (8). In the constant series, all the 
displacements in a sequence were the same size. 

Four sessions were devoted to the mixed and 
four to the constant series for each S. Thus, 
each S performed on each pressure-displacement 
condition under two different methods, making 
a total of eight experimental sessions per S. 

On the mixed series, a trial consisted of 32 
displacements. Since there were 8 possible 
displacements (4 in each direction), each trial 
consisted of 4 settings for each displacement. 
Each session was made up of 4 trials, so that 
during a single session S made 16 settings for 
each of the 8 possible displacements. 

Since it could be expected that practice on a 
series of displacements of constant size would 
tend to eliminate the range effect (3), it was 
possible, by comparing performance on the mixed 
with performance on the constant series, to 
obtain an estimate of how much the range effect 
had contributed to the earlier results. 

To keep the amount of practice on a particular 


size of displacement for a particular condition 
equivalent for both series, the length of each 
sequence in the constant series was chosen to be 
16 settings. Order effects were counterbalanced 
by a latin-square design. 

As in the previous experiment, the Ss received 
preliminary training to acquaint them with the 
apparatus and task. At the beginning of each 
session, S made one setting at each of the 8 
displacements, which were flashed on manually 
by £. For the rest of the session, the displace- 
ments were presented automatically by a selec- 
tive sequence programmer and timer. The 
presented displacement and S’s response were 
recorded by a Brush BL-202 inkwriter oscillo- 
graph. 

Each S performed on all conditions with both 
series. The effects of sequence of conditions 
were controlled by counterbalancing. 


RESULTS 


For.the mixed series, the results to 
be analyzed are derived from the last 
trial of each session. Since each trial 


TABLE 2 


ANALYSIS OF VARIANCE OF THE CONSTANT 
Error Data 























Source af — F 
Pressure | 3 | 7,356.78 53 
Angle 3 192,119.87 13.71* 
Series l 45,375.78 3.26 
PXA 9 6,730.70 48 
PxsS 3 13,738.95 99 
AXS 3 109,132.03 7.85* 
PXAxXS 9 14,011.75 

Total 31 | 

*P < 01. 
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TABLE 3 


Weser Ratio ror THE Mrixep aNp ConsTANT SERIES AS A 
Function or ANGLE AND PRESSURE 
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Constant Series Mixed Series 
Mean 
Angle FN sor 
7. 5 i 0-15 ries 

ub. | ‘Lb. | Le. | To. | Mean | ih. | Tb> | TES | Tee | Mean 

3" 158 | .207 | .226 139 182 | .267 | .351 | .349 | .352 | .330 256 
: ig 183 132 | .097 | .083 124 | .171 | .195 | .198 | .172 | .184 .154 
4.5° 124 123 | .078 093 104 | .188 | .172 | .185 | .147 | .173 .138 
6° 101 | .087 | .102 | .091 095 | .129 | .159 | .135 | .142 | .141 118 
Mean 142 | .137 | .126 | .102 | .126 | .189 | .219 217 | .203 | .207 .166 






































contained 4 settings for each of the 8 
displacements, every displacement 
within a condition was defined by 
data from 32 settings or 4 by each S. 
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Fic. 1. Constant error in degrees for the 
mixed and constant series as a function of angle. 


The last 4 settings of each sequence 
in the constant series were selected to 
be compared to the comparable set- 
tings on the last trial of the mixed 
series, since the latter comprised the 
last 4 of the 16 settings that were made 
during a session on each of the 8 
displacements. 

The results are described in two 
ways: constant error and Weber ratio, 
the latter being computed as the ratio 
of the SD to the size of the required 
displacement. The data for push and 
pull are combined since the differences 
between the directions were not sig- 
nificant. 

Constant errors.—The CE’s in de- 
grees for both series are given in 
Table 1. The significance of the 


observed differences was assessed by 


TABLE 4 


ANALYsIS OF VARIANCE OF THE WEBER 
Fraction DaTa 


























Source df FP -- F 
Pressure 3 964.75 1.62 
Angles 3 30,106.42 50.61* 
Series 1 51,842.00 87.15* 
PXA 9 945.50 1.59 
Pxs 3 1,113.08 1.87 
AxsS 3 4,133.25 6.95** 
PxAxS 9 594.89 

Total 31 

* < Ol 

=P < .02 
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Fic. 2. Weber ratios for the mixed and 
constant series as a function of angle. 








an analysis of variance, shown in 
Table 2. This reveals that the differ- 
ences among pressures and series are 
not significant by the usual criteria. 
The differences among angles are 
significant. As can be seen in Fig. 1, 
the significance is mainly derived from 
differences on the mixed series, which 
shows a prominent “range effect.” 
This statement is supported by the 
significant Angle X Series interaction, 
which also means that the slopes of 
the two curves are significantly dif- 
ferent. 

Weber ratios.—Table 3 gives the 
Weber ratios for the two series and 
Table 4 shows the results of the 
analysis of variance. Again, pres- 
sure differences are not significant, 
although a slight trend appears in 
the data for the constant series. 
Angles and Series, however, did pro- 
duce significant variation. Figure 2 


shows that except for the initial rate 
of decline from 1.5 to 3 degrees, the 
Weber ratios for the two series are 
proportional. The F* ratio for the 
Angle X Series interaction, however, 
indicates that these initial slopes are 
significantly different. 


DiscussIOoN 


Three conclusions seem to follow from 
the results of this and the earlier study 


(8): 


1. When a positioning task is used in 
which visual feedback is delayed, pres- 
sure variation, over a wide range of 
pressure and displacement values, will 
not significantly alter CE or the precision 
of adjustment as defined by the Weber 
ratio or relative variability. 

2. A mixed sequence of displacements, 
as might be expected, results in less 
precise discrimination than a constant 
series. But, also, it tends to produce its 
greatest effect at the shortest displace- 
ment in a range, making the loss of 
precision here proportionately greater 
(see Fig. 2). The relative variability 
data of the earlier study (8) also suggest 
that the shorter the ranze of movement, 
the greater the initial slope. That is, 
shortening the range of movement tends 
to exaggerate the departure from the 
horizontal. 

3. Similarly, there seems to be a 
tendency for the range effect, as it ap- 
pears in this task (on the mixed series), 
to be skewed toward the shorter dis- 
placements in a range. That is, for a 
single range of movement, the CE in 
degrees will tend to be higher at the 
shortest displacement than at the longest. 
A rough index of this skewness may be 
secured by computing the ratio of the 
CE at the shortest displacement to the 
CE at the longest. For the earlier 
study, the mean ratio for the six con- 
ditions in which displacement varied is 
1.37. In the present study it is 1.33. 
Also, the ratio appears to change with the 
size of the range, growing larger as the 
range becomes smaller, and thus dis- 
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playing a trend similar to that found for 
precision. 


Possible explanations for the apparent 
ineffectiveness of pressure variation as a 
cue in this kind of task were discussed in 
the earlier paper (8) and seem applicable 
here. Two alternatives were offered. 
One was that distance discrimination 
may be so much finer than pressure 
discrimination that most of the pressure 
information is redundant. The other 
was that distance information may be 
remembered more easily and niore pre- 
cisely than pressure information, since 
it seems easier to conceptualize and pro- 
vides a greater variety of cues. 

The finding that CE on a constant 
series of misalignments is less than on a 
mixed series is hardly an occasion for 
surprise. Other investigators have re- 
ported similar results when using tasks 
like that used here (1, 3). The interest 
it holds arises from the difference in the 
pattern of error, which was reflected in 
the significant Angle X Series _inter- 
action and in the ratio of CE at the 
shortest to CE at the longest displace- 
ment. Perhaps the phenomenon taking 
place might be called a “nonlinear range 
effect.”” Other data also suggest that 
with mixed series such an effect may be 
obtained. Slack’s calculations (6), for 
instance, reveal that the responses to 
the smallest displacement in his task 
are more heavily skewed toward the 
mean of the input distribution than 
responses to the larger ones. 

Models to account for the range effect, 
which Hollingworth called the law of 
central tendency (4), have been proposed 
by Johnson (5) and by Slack (7). 
Johnson views this phenomenon as an- 
other manifestation of the regression 
toward the mean that occurs when two 
variables are imperfectly correlated—the 
variables in this instance consisting of 
stimulus magnitudes and response mag- 
nitudes. Slack’s model proposes that 
the error on any trial will be a function 
of a weighted average of the differences 
among succeeding magnitudes of the 
earlier stimuli. Both of these models 
assume linearity of the effect. 


The Weber fraction values provide an 
interesting corollary to the CE data. 
They show, not only that a mixed series 
produces greater dispersion than a con- 
stant series, but that, as with CE, the 
pattern, or distribution of variable error, 
i.e., Weber ratio, is altered. Perhaps 
more important is the fact that for both 
series the Weber fraction shows a pro- 
gressive decline with angle. That this 
is not a phenomenon restricted to one 
particular range of movement may be 
seen by examining the relative variability 
data of the earlier experiment (8), which 
indicate that the effect is more prominent 
with the shorter ranges of movement. 
It appears, therefore, to be a phenomenon 
different from the decrease in precision 
at low-intensity values often associated 
with the Weber functions of various 
sensory modalities. Perhaps it is not 
too unreasonable, in fact, to suspect that 
part of the latter findings may be due to 
this ‘‘skewed dispersion effect.”” In any 
event, it provides one more complication 
for models of psychophysical judgment 
to account for. 


SUMMARY 


The S’s task in this study was to compensate 
for the displacement of a spot of light from the 
center of an oscilloscope screen. He did this by 
moving a control stick during a period when the 
spot was not visible. To determine the effect 
of pressure variation on the CE and precision 
(Weber ratio) of positioning responses at small 
distances, four pressure-displacement conditions 
were used: a maximum displacement of 6° with 
maximum pressures of 0, 7.5, 15, and 30 Ib. 
Eight Ss were used, each serving under all 
conditions. The displacements were presented 
in two orders: a mixed (random) series and a 
constant series. 

For both series, pressure variation had no 
significant effect on CE and the Weber fraction. 
The pattern of error and dispersion for the two 
series differed, however. In the mixed series, 
the range effect was quite prominent as compared 
to the constant series. Also, the CE at the 


shortest displacements, when the data of an 
earlier experiment are included, seems to be 
greater than the CE at the longest, thus indi- 
cating nonlinearity of the range effect. This 
nonlinearity seems to be exaggerated as the 
range of movement is shortened. The Weber 
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ratios for the two series seem to be proportional 
except for the initial decline as the extent of 
movement increases, the mixed series resulting 
in a steeper fall. Both series show a greater 
Weber ratio at the shorter displacements, and 
along with the data of the earlier study suggest 
that this departure from linearity is magnified 
by compressing the range of movement. 
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GENERALIZED EXTINCTION AND SECONDARY 
REINFORCEMENT IN VISUAL DISCRIMINATION 
LEARNING WITH DELAYED REWARD 


G. ROBERT GRICE AND HERBERT M. GOLDMAN 


University of Illinois 


In a study of the role of secondary 
reinforcement in delayed reward learn- 
ing, Grice (1) has found that white- 
black discrimination learning is facili- 
tated by the use of goal boxes of the 
same color as the alleys comprising 
the choice stimuli. In an experi- 
mental group, animals entered either 
a black or white stimulus alley, and, 
after a 5-sec. delay in a gray com- 
partment, entered an end box of the 
same color as the stimulus chosen. 
The reward was received in the white 
end box. For the control group, both 
end boxes were gray, the reward being 
placed in the gray box following 
choices of the white alley. The 
superiority of the experimental group 
was attributed to the generalization 
of secondary reinforcement from the 
white goal box to the white stimulus 
alley. This was assumed to provide 
a basis for immediate reward following 
the correct choice which was not 
present for the control group. This 
interpretation neglects, however, the 
possible effects on learning of the fact 
that incorrect choices always led to 
an empty end box of the same color 
as the negative stimulus alley. It is 
quite possible that inhibitory effects 
of the empty end box may generalize 
and provide a basis for rapid extinc- 
tion of approach responses to the 
negative stimulus alley. This pos- 
sibility is readily deduced from S-R 
interpretations of discrimination 
learning which emphasize the role of 
inhibitory factors (4, 6), and is also 
suggested by experimental findings 
indicating their importance (2, 3, 5). 


The purpose of the present experi- 
ment is to evaluate the relative con- 
tributions of generalized secondary 
reinforcement and inhibition at the 
choice point resulting from the ma- 
nipulation of end-box experiences. 


PROCEDURE 


General design—The experiment involved 
four groups trained on a white-black discrimi- 
nation problem with 2-sec. delay following choice 
of either the correct or incorrect alley. The 
independent variable was the color of the end 
boxes following either choice. As in the earlier 
study (1), all S’s were trained with white posi- 
tive. For the control group (GG) both of the 
end boxes were gray, the food being placed in 
the box following the white stimulus alley. For 
a second group (WB), there was a white goal box 
and a black, negative end box. This condition 
corresponds to that of the original experiment 
and superiority over the control condition may 
be attributed either to generalized secondary 
reinforcement or inhibition or both. A third 
group (WG) had a white goal box and a gray, 
negative end box. This condition presumably 
involves the secondary reinforcement effect but 
not the inhibition effect. The fourth group 
(GB) had a gray goal box and a black, negative 
end box. Superiority of this group over the 
control should be due to the generalization of 
extinction but should not involve the secondary 
reinforcement effect. 

Apparatus.—The apparatus was the black 
and white alley, delayed reward discrimination 
apparatus previously described by Grice (1). 
There were certain minor differences in the 
present form of the apparatus: (a) The floor of 
the V-shaped choice chamber was gray rather 
than extensions of the black and white alleys. 
(b) The color of the end compartments was 
varied by the use of sliding three-alley sections 
similar to the stimulus alleys rather than by the 
use of inserts. (c) Lighting was from bulbs 
placed over the stimulus alleys and over the end 
boxes. The average brightnesses of the white 
stimulus alley and the white goal boxes were 
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TABLE 1 
TRIALS AND Errors TO CRITERION OF LEARNING 
Errors Trials 
Group 
Median Mean Range Median Mean Range 

GG 128.5 Mg 100-492* 305 . 240-880* 
WB 43 38.9 12-59 105 103 30-190 
WG 58 61.5 27-93 165 158 80-230 
GB 40 40.6 11-79 110 106 30-220 























* Means are indeterminate since one S in the GG group failed to learn in 880 trials and was discontinued. 


approximately 25.6 and 23.6 apparent ft.- 
candles, respectively. The brightness of the 
black stimulus alley was about 1.3 apparent ft.- 
candles, and that of the black end box was 1.1. 
The gray goal boxes were about 5.0 apparent 
ft.-candles. Each of these values is a mean of 
10 readings with a Macbeth illuminometer. 
(d) The 2-sec. delay employed with all groups 
in the present experiment was achieved by 
delaying Ss for 2 sec. in the 18-in. delay com- 
partments rather than by allowing them to run 
through 6-ft. alleys as in the previous experiment. 
Timing was done with a stop watch. 

Subjects—The S’s were 20 male and 20 
female, experimentally naive, albino rats of 
Sprague-Dawley strain. Their ages ranged from 
73 to 98 days at the beginning of the experiment. 
The S’s were randomly assigned to the groups 
with the restrictions that each group contain the 
same number of males and females and that 
litter mates be divided equally among the groups 
whenever possible. Three animals originally 
assigned to groups were discarded and replaced. 
Two were eliminated because of experimenter 
error and the other because it displayed an initial 
preference for the positive stimulus (9 out of the 
first 10 trials). 

Training procedure—The pretaining and 
training procedures have been previously de- 
scribed (1). The only departure from the 
previous procedure was that in the black-white 
discrimination training, there were 10 trials per 
day for only the first 40 trials instead of 200, and, 
after that, there were 20 trials per day. The 
criterion of learning was 18 out of 20 correct 
responses with the last 10 perfect in two suc- 
cessive blocks of 10 trials. 


RESULTs AND DiscussIoN 


The mean and median number of 
trials to criterion and number of total 
errors are presented in Table 1. The 
Mann-Whitney U test of significance 


‘ 


was applied to the differences between 
the groups and the results of these 
tests are presented in Table 2. The 
group learning curves based on per- 
centage correct for blocks of 20 trials 
are presented in Fig. 1. In order to 
plot these curves, Ss were assumed to 
run 100% correct after the criterion 
runs. 

The most obvious finding is that 
all three of the groups involving dif- 
ferential end-box stimulation are 
clearly superior to the GG control 
group. There was no overlap in 
either trials or errors between this 
group and any of the three experi- 
mental groups. Differences among 
the three experimental groups are 
small. The White-Black and Gray- 
Black groups are approximately equal 
and both are somewhat superior to 
the White-Gray condition. 

The data clearly indicate the effective- 
ness of the black end box following the 


TABLE 2 


Resutts or Mann-Wuitney U Test or 
DirFERENCES BETWEEN Groups 














Errors Trials 

Groups Compared P = P er 

GGvs.WB  —<.0001 <.0001 
GG vs. WC <.0001 <.0001 
GG vs. GB <.0001 <.0001 
WB vs. GB 409 .224 
WB vs. WG 035 019 
WG vs. GB .045 145 
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SECONDARY REINFORCEMENT 


choice of the negative, black alley. In 
fact, this condition produced just as 
rapid learning as did the presence of end 
boxes resembling both of the stimulus 
alleys. This finding indicates that Grice’s 
analysis of the original experiment (1) 
was incomplete, and suggests that an 
important factor in such learning may 
be the generalization of extinction from 
the negative end box to the similar alley 
at the choice point. This would then 
tend to produce a more rapid elimination 
of errors than in the control group. To 
extend the original theoretical analysis 
of delayed reward learning, this findtng 
suggests that learning with delayed 
reward will be facilitated not only by 
conditions producing immediate second- 
ary reinforcement of the correct response, 
but also by conditions hastening ex- 
tinction of competing responses. 

While apparently incomplete, Grice’s 
original analysis in terms of secondary 
reinforcement is further supported by 
the present experiment. The clear-cut 
superiority of the White-Gray group over 
the control group indicates that the 
generalization of secondary reinforce- 
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ment to the positive choice-point stimu- 
lus does facilitate learning even in the 
absence of the generalized inhibitory 
effect of the black end box. The data, 
then, are consistent with the origina! 
hypothesis of Spence (7) and Grice (1) 
that delayed reward learning is produced 
by immediate secondary reinforcement. 

In view of the fact that generalized 
secondary reinforcement and generalized 
extinction are both effective in the 
present situation (as evidenced by the 
WG and GB groups), it may appear 
rather strange that these two effects did 
not clearly summate to produce a marked 
superiority of the WB group where 
presumably both are involved. The 
WB group was somewhat superior to the 
WG group but about the same as the 
GB group. Of course, the logic of sta- 
tistical reasoning does not allow us to 
conclude that no true superiority of the 
WB condition exists, but even so, this 
failure to obtain a difference is not as 
strange as it first appears. First, we 
may assume that the habits of ap- 
proaching both the black and white alleys 
attain considerable strength as the result 
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of the mnondifferential secondary rein- 
forcement from entry into the gray delay 
compartments. Thus, the added rein- 
forcement received from the use of the 
white goal box would add rather small 
increments to the habit under the as- 
sumption of a negatively accelerated 
habit growth curve. It is not un- 
reasonable, then, that the generalized 
inhibitory effect upon approach re- 
sponses to the black should produce a 
more rapid elimination of errors. This 
is indicated by the superiority of the GB 
group over the WG. Since the extinc- 
tion effect occurs in fewer trials than the 
reinforcement effect, we are led to believe 
that the WB and GB groups learned es- 
sentially in the same way, namely, by 
the extinction of responses to the nega- 
tive stimulus. A more complete analysis 
of the situation would indicate that, even 
in view of these considerations, there 
should be some superiority of the com- 
bined effect in the WB group. However, 
its superiority would be small and per- 
haps clearly demonstrable only in a very 
large-scale experiment. 


SUMMARY 


Four groups of rats were run in a white-black 
visual discrimination learning situation with 2- 
sec. delay of reward. In one group, the end 
boxes were the same in color as the corresponding 
stimulus alleys. For a second group only the 
goal box was the same as the positive choice- 
point stimulus. In the third group, the empty 
end box was the same as the negative stimulus. 
For a control group neither end box was like the 
choice-point stimuli. 

In terms of trials and errors to a criterion, 
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all three experimental groups were significantly 
superior to the control group. There were only 
small differences among the three experimental 
groups, but the group with only the goal box 
similar to the positive stimulus was somewhat 
inferior to the other two. 

The results substantiate an earlier hypothesis 
explaining delayed reward discrimination learn- 
ing in terms of generalization of secondary rein- 
forcement to the choice-point cue. However, 
this interpretation was found to be incomplete, 
and the experiment suggests that in this situation 
an even more effective variable is the generali- 
zation of extinction to the negative choice-point 
cue. 
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This paper presents a very simple 
mathematical model for predicting 
choices among bets and an experiment 
in which that model is tested. The 
model is based on the concepts of 
utility and subjective probability and 
on the theory of games. 


DEVELOPMENT OF THE MoDEL 


Four models for decision-making.—This 
paper will discuss four similar simple 
models which have been used to make 
predictions of choices among bets. They 
are defined by the following four equa- 


tions: 
EV = > pbs (1) 
EU = > pins (2) 
SEM = > P*$; (3) 
SEU = E P* :0; (4) 


Equation 1 defines the concept of 
expected value (EV). The symbol 9p; 
stands for the probability of the ith 
outcome of the bet, the symbol $; stands 
for the value of the ith outcome of the 
bet in dollars, and > p= 1. The EV 


7 
of a bet is the amount which a gambler 
will receive as a result of making it, on 


1 This research was done while the author was 
at The Johns Hopkins University. It was done 
under Contract N5-ori-166, Task Order 1, be- 
tween the Office of Naval Research and The 
Johns Hopkins University. This is Report No. 
166-I-191, Project Designation No. NR-145-089, 
under that contract. I ara indebted to Miss 
Frances Brown, who ran Ss and processed much 
of the data, and to Miss Susanne Wollenberg, 
who also helped with the data processing. A 
very preliminary report of this experiment, in- 
cluding only the data from Ss 3 and 5, was 
presented to the Dunlap Symposium on Mathe- 
matical Models in the Social Sciences (8). 


the average. Consequently, the tradi- 
tional normative theory about choices 
among bets is that gamblers should 
choose the bet with the highest EV or 
lowest negative EV. (A _ normative 
theory is a theory about what people 
should do, rather than what they do do.) 
It is, of course, apparent that people 
don’t do this, and it is even doubtful in 
some cases that they should. Conse- 
quently, Bernoulli (1) suggested instead 
that gamblers should choose the bet with 
the highest expected utility (EU), where 
EU is defined as in Equation 2. The 
symbol 4; in Equation 2 is the ufility, or 
subjective value, of the ith outcome of 
the bet. This proposal, then, simply is 
a suggestion that the utility or subjective 
value of a given amount of money be 
substituted for its objective value in the 
calculation of an expected value. 
Recently, von Neumann and Morgen- 
stern (13) have revived Bernoulli’s sug- 
gestion, and have used Equation 2 as a 
definition of utility. This assumes, of 
course, that EU maximization is a theory 
about what people do do, rather than a 
normative theory. Following von Neu- 
mann and Morgenstern’s suggestion, 
Mosteller and Nogee (10) assumed the 
validity of the EU maximization model 
and used it to measure the utility of 
small amounts of money experimentally. 
If it is reasonable to assume that sub- 
jective values of money should be sub- 
stituted for objective values in Equation 
1, it is equally reasonable to make the 
same assumption about probabilities. 
The notion of subjective probabilities 
has been discussed for some time; one 
of the most explicit uses of it (without 
use of the concept of utility) was made 
by Preston and Baratta (11). They 
implicitly suggested that Ss be assumed 
to make choices among bets so as to 
maximize what might be called sué- 
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jectively expected money value (SEM), 
which is defined by Equation 3. The 
symbol P*; in Equation 3 stands for the 
subjective probability corresponding to 
the objective probability of the ith al- 
ternative. No addition theorem applies 
to the P* values; however, the sum of 
the objective probabilities corresponding 
to the various P* values in Equation 3 
is 1. Having assumed the validity of 
the SEM maximization model, Preston 
and Baratta used it to calculate a sub- 
jective probability scale. Mosteller and 
Nogee also used Equation 3 to calculate 
subjective probabilities, but preferred 
the utility interpretation given by Equa- 
tion 2. Similarly, it would be possible 
to analyze Preston and Baratta’s data 
by Equation 2 instead of Equation 3. 
There is little to choose between these 
opposed interpretations of either of these 
two experiments. 

In a series of experiments Edwards (3, 
4, 5) has shown that Ss prefer to bet at 
some probabilities rather than others, 
that these preferences are essentially in- 
dependent of the amounts of money in- 
volved so long as they are not too large 
and so long as all bets compared with 
each other have the same EV, and that 
Ss prefer less favorable bets at their 
preferred probabilities to more favorable 
bets at disliked probabilities. These 
experiments show that there are two 
patterns of probability preferences, one 
for positive EV bets and another for 
negative EV bets. These results, of 
course, can be easily accounted for by 
means of the SEM model, but are incon- 
sistent with the EU model. 

All these considerations suggest a more 
comprehensive model, which includes 
both utility and subjective probability. 
This paper is intended to present and 
test such a model. The model asserts 
that people choose among bets so as to 
maximize subjectively expected utility 
(SEU), which is defined in Equation 4. 
The symbols in Equation 4 have the same 
meaning as before. This model, like the 
others, does not assert that Ss know the 
SEU values of various bets and de- 
liberately choose the bet with the largest 
SEU; it asserts only that the hypothesis 
of SEU maximization will enable an E 
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to predict the choices that Ss will actually 
make. 

This model is similar to one proposed 
by Coombs and Beardslee (2). The 
most important difference is that Coombs 
and Beardslee assume both P* and u to 
be measurable only on an ordered metric, 
while this model (and the EU and SEM 
models also) assume them to be meas- 
urable on an interval scale. 

For, a fuller discussion of the concepts 
of utility and subjective probability and 
of the Mosteller and Nogee, Préston and 
Baratta, and other related experiments, 
see (6). 


Tue Test oF THE MopeE. 


In order to test the SEU maxi- 
mization model, three things are 
necessary. Scales of utility and of 
subjective probability must be de- 
termined for Ss whose behavior is to 
be predicted, and Ss must make 
choices among bets, so that these 
choices can be compared with those 
predicted by the model. 

The crux of the problem of testing 
this model is to devise methods of 
measuring utility and subjective prob- 
ability. Classical psychophysical 
methods seem inappropriate; both 
probabilities and amounts of money 
come stated in numerical form, and 
psychophysical judgments about them 
are almost certain to be made in 
terms of their numerical rather than 
their subjective values. 


The matrix method—The theoretically 
ideal method of measuring utility and 
subjective probability simultaneously in 
a gambling situation has been suggested 
by Woodbury (14). The essence of it 
is that a rectangular matrix with amounts 
of money on one margin and probabilities 
on the other defines a series of proba- 
bility-amount combinations, or bets. An 
S can be required to inform E of amounts 
of money such that he is indifferent be- 
tween them and the bets in the matrix. 
The SEU maximization model permits a 
system of simultaneous equations based 
on these judgments to be set up, enough 
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more equations can be added by linear 
interpolation between adjacent utility 
values to make the system soluble, and 
the solution gives the utility and sub- 
jective probability functions needed. 

This procedure was tried in this ex- 
periment, using the method of adjust- 
ment. The data were so variable as to 
be meaningless, and so will not be further 
discussed. Less variable data could 
probably be gotten by the method of 
constant stimuli, but it is unlikely that 
they would be good enough to meet the 
exacting needs of the matrix method. 

The N-bets method of utility measure- 
ment.—If no adequate technique for 
measuring utility and subjective proba- 
bility simultaneously can be devised, 
then it is necessary to measure them 
separately. The methods used by Mos- 
teller and Nogee (10) and by Preston and 
Baratta (11) are inappropriate, since 
they irretrievably confound utility and 
subjective probability. Some technique 
must be invented which either eliminates 
one of these variables or else holds it 
constant. 

The method used in this experiment, 
which will hereafter be called the N-bets 
method, is based on a sweeping, con- 
venient assumption. That assumption 
is that any bet can be considered as an 
object which has a utility (which can be 
calculated from Equation 4, once the 
utility and subjective probability func- 
tions are known), and that the utility of 
N identical bets is N times the utility 
of one such bet. This is an assumption 
that what the economists call the mar- 
ginal utility function for bets, but not 
necessarily for any other object, is a 
constant. (The marginal utility of a 
commodity is the first derivative of its 
total utility curve.) 

How can this assumption, be used to 
measure utility? Suppose that § is in- 
different between one 50-50 bet in which 
he wins $A if he wins and nothing if he 
loses, and N 50-50 bets in each of which 
he wins $0.10 if he wins and nothing if 


he loses. The assumption discussed 
‘above makes it legitimate to write: 


P*o.5 u $A + Pos u $0.00> 
N (P*o.5 4 $0.10 + P*o.s u go.o0) (5) 


Since both utility and subjective 
probability are defined as measurable on 
an interval scale, we may arbitrarily 
assign two points on each curve. Here 
and throughout this paper the following 
arbitrary definitions will be used: u ¢o.00 
= 0; 4 440.10 = 10; u —g0.10 = —10; P*o 
= 0; P*,=1. In cases where $0.15 
small bets are used, uw 4g0.15 = 15 and 
ee | dine 15. 

If these definitions and some simple 
algebra are applied to Equation 5, the 
terms involving 4 go.00 drop out, the 
P*,.5 factors out, and Equation 5 simpli- 
fies to: 

4a = 10 N (6) 


Since N is known, the utility of $A can 
be calculated. The same procedure can 
be carried out for other amounts of 
money, and so the whole utility curve 
can be determined. It should be noted 
that this utility curve, like those deter- 
mined by Mosteller and Nogee, is not a 
total utility curve of the type which is 
usual in economics. Its zero point is 
where S is at the moment he makes his 
choice. Since S§ wins and loses money 
during the course of each session, that 
zero point is not a fixed point on his total 
utility curve in the classical sense. Con- 
sequently, the kind of utility curve which 
this procedure produces is closely analo- 
gous to the kind suggested by Markowitz 
(9), except that Markowitz takes zero 
to be S’s usual location, while this pro- 
cedure takes it to be his current location. 

The N-bets method offers two ad- 
vantages. First, it is straightforward 
and the experiment it suggests is easy 
to perform. Secondly, it offers a built-in 
check on some of the assumptions in- 
volved. There is no reason why 50-50 
bets need be used, and there is no reason 
why the small bets should involve $0.10. 
Other probabilities and other sizes of 
small bets could be used. All proba- 
bilities and all sizes of small bets should 
lead to identical utility curves. If they 
do not, then it probably means that the 
N-bets assumption is incorrect, and the 
marginal utility of bets is not constant. 
For any given marginal utility function 
for bets, it is easy to predict the exact 
nature of the difference between utility 
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functions obtained from different sizes 
of little bets. 

If Ss preferred to bet at some vari- 
ances rather than others, then the exist- 
ence of such preferences would invalidate 
the N-bets method of utility measure- 
ment, and also the EU, SEM, and SEU 
maximization models. However, Ed- 
wards (7) has shown that such prefer- 
ences, though they do exist, are of minute 
importance as compared with proba- 
bility preferences, and so can be neg- 
lected. 

How to measure subjective probabili- 
ties—Once a utility scale is available for 
a given S, it is very easy to measure 
subjective probabilities. Suppose that 
S is indifferent between a certain gain of 
$0.55 and a 3/8 chance of winning $2.50. 
The SEU maximization model permits 
us to write: 


P* s/s U $2.50 + P* sis u $0.00 
= P*,u $0.55 (7) 


This simplifies to: 
P* 3:3 = u $0.55/U $2.50 (8) 


Since the two utilities are known, the 
value of the subjective probability cor- 
responding to an objective probability of 
3/8 can be calculated. By determining 
other indifference points involving other 
probabilities, other subjective proba- 
bilities can be calculated. No assump- 
tions beyond those involved in the SEU 
maximization model are necessary. The 
same procedure, carried out with losses 
instead of gains, gives the subjective 
probability scale for losing bets. 

How to test the SEU maximization 
model.—Once individual utility scales 
and individual subjective probability 
scales are available, testing the SEU 
maximization model is accomplished by 
having the same Ss make choices among 
bets. The SEU of each bet for each S 
can be calculated, and the prediction is 
that he will always choose the bet with 
the higher SEU (or smaller negative 
SEU, if no positive SEU is available). 
Since utility and subjective probability 
are measured only on interval scales, 
they have no true zero points. Con- 
sequently, the SEU model cannot be 
used to predict choices among bets which 


include both possibilities of winning and 
possibilities of losing. So simple bets, 
in which only possibilities of winning or 
possibilities of losing appear, must be 
used to test the SEU model. 


METHOD 


Apparatus.—All gambling in this experiment 
was done on two pieces of gambling apparatus. 
One was a pinball machine, which has been more 
fully described elsewhere (3). It has eight cells 
along its bottom, into any one of which the ball 
may roll. The Ss are told that the ball is just 
as likely to roll into any one cell as into any other. 
Under the surface of the machine eighteen elec- 
tromagnets are concealed, which completely 
control which cell the ball ends up in. These 
electromagnets are operated by a programmer 
in the next room, and the program is set up so 
that the instructions to Ss are correct. 

The other apparatus was designed because it 
seemed desirable to have some apparatus which 
could run off the strings of identical bets neces- 
sary for the N-bets method of utility measure- 
ment considerably more quickly than they could 
be run off on the pinball machine. Conse- 
quently, a bank of eight counters was set up. 
The counters were numbered from 1 through 8, 
and were automatically resettable. In front 
of them was a button. When S pushed it, one 
of the eight counters received one electrical im- 
pulse. A programmer determined which counter 
would be operated. By holding the button 
down, S could operate the apparatus con- 
tinuously, so that a succession of impulses 
operated various counters. An auxiliary counter 
counted the total number of impulses entered 
into the bank of eight counters since the last 
resetting. Thus, in order to run off 20 identical 
bets, all S had to do was to hold down the button 
until the auxiliary counter read 20, and read off 
from the eight main counters the number of times 
those counters which paid off for him had been 
operated. The whole process takes about 20 to 
30 sec. This procedure is simply a faster equi- 
valent of playing the pinball machine 20 separate 
times, each time making an identical bet. 

Subjects —The Ss for this experiment were 
five Johns Hopkins male undergraduates. They 
were naive about the purposes of this experiment, 
had not participated in gambling experiments 
before, and for the most part were relatively 
unfamiliar with gambling. The names of those 
who were invited to be Ss were selected at 
random, but it was necessary to issue 23 — 
vitations in order to get five acceptances 
Therefore this cannot be regarded as a random 
sample of Johns Hopkins undergraduates. How- 
ever, so far as the available information goes it 
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is a random sample of those Johns Hopkins 
undergraduates who are willing to participate in 
prolonged gambling experiments. 

General procedure-—Each S was told that his 
purpose in the experiment was to make as much 
money as possible. He was also told that every 
game he would play in the experiment was 
mathematically “fair,” that is, that in the long 
run he should break about even. However, he 
was told that the situation would be biased in 
his favor by $1.00 per session. This was accom- 
plished by giving him $21 in chips at the be- 
ginning of each session and taking back only 
$20 in chips or cash at the end of each session. 
Each S knew that he was expected to pay any 
losses out of his own pocket, and that he would 
be paid his winnings in cash. However, when 
an S lost more money than he had available 
(one S lost $120 in one session), he was allowed 
to pay with an JOU until next time. 

After all sessions were finished, each S was 
run for an extra session, in which the program of 
the gambling apparatus was prepared so as to 
cause him to win or lose enough so that he won 
an average of $1 per hour for the entire experi- 
ment. Each S spent a total of about 50’hr. in 
the experiment. 

On all sessions in which the eight-counter 
apparatus was used, S was permitted to select 
which counters would be “his” and which would 
not. The counters which he chose as his would 
pay off in his favor; the others would be against 
him. He could not change selections in mid- 
session. This arrangement was necessary be- 
cause it was obvious that the eight-counter 
apparatus was programmed, and so it was 
necessary to make it clear that the program was 
not prepared unfairly. On the pinball machine, 
whose electrical connections and mode of oper- 
ation were not visible, it was not necessary to 
do this. 

Utility measurement sessions ——The first 12 
sessions were devoted to the N-bets method of 
utility measurement. In Sessions 1-4, the 
probability of winning was 4/8 and the amount 
involved in each little bet was $0.10. In Ses- 
sions 5-8, the probability of winning was 4/8 
and the amount involved in each little bet was 
$0.15. In Sessions 9-12, the probability of 
winning was 2/8 and the amount involved in 
each little bet was $0.10. The amounts of 
money whose utilities were measured in each 
session were plus $5.50, $4.50, $3.50, $2.50, 
$2.00, $1.50, $1.00, $0.90, $0.80 $0.70, $0.60, 
and $0.50; and minus those same amounts. For 
each amount of money, seven cards were pre- 
pared, each with a different value of N. A 
sample card read: 

A. You have four numbers. If you roll any 

one of your numbers, you win $5.50. 
If you do not, you win nothing. 


B. You have four numbers. You make 30 
identical bets. On each bet, if you roll 
one of your numbers, you win $0.10. If 
you do not, you win nothing. 

All other cards were similar, except that they 
included different amounts which could be won 
in the big bet and different numbers of small 
bets. To measure negative utilities, the verb 
was changed from “win” to “lose.” In Sessions 
5-8, each alternative began “You have two 
numbers.” In Sessions 9-12, each small bet 
paid off $0.15 instead of $0.10. 

In addition to the cards already discussed, 
other cards were prepared which were identical 
except that the value of N was left blank. This 
was necessary because it was possible that the 
indifference point for a given S might not fall, 
on one day, within the range of values of N 
which had been selected. Whenever that hap- 
pened, E wrote in values of N which seemed 
more likely to be near the indifference point, and 
slipped the new cards into the pack. The Ss 
were, of course, aware that E added cards to the 
pack from time to time, and they eventually 
learned that E kept on doing so until S had 
changed over from 4-type alternatives to B-type 
alternatives, or vice versa. This knowledge 
would have made it possible for an S who was 
skilled at gambling to adopt the strategy of 
changing quickly on losing bets (thus maxi- 
mizing the number of winning extra cards). To 
see whether anyone did this, counts were made 
of the number of extra cards-used on winning and 
losing bets at various stages of the experiment. 
No indication could be found in these counts 
that any S used such a strategy. In fact, most 
Ss tended to resist changing more stubbornly on 
losing bets than on winning ones. 

When the approximate indifference point had 
been located in the first session, thereafter only 
cards reasonably near it were used in subsequent 
sessions. This reduced the total number of 
cards, and so the length. of the sessions. It is 
probable that the selection of the range of cards 
presented to S had at least a minor effect on his 
indifference point. This effect could not be 
avoided, since some range had to be selected; it 
could only be hoped that the effect would be 
minor compared with the utility and subjective 
probability differences being studied. The ef- 
fectiveness of the utility and subjective proba- 
bility scales determined by this method in 
predicting choices among bets suggests that this 
hope was fulfilled. 

Subjective probability measurement sessions.— 
Sessions 13-18 were subjective probability meas- 
urement sessions. The probabilities whose 
subjective probabilities were measured were 1/8, 
2/8, 3/8, 4/8, 5/8, 6/8, and 7/8. Each sub- 
jective probability was determined twice in each 
session, once for positive EV bets and once for 
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negative EV bets. For each probability, cards 
offering that probability of winning various 
amounts of money compared with the certainty 
of winning $0.55 were prepared. A sample card 
read: 


A. You have three numbers. If you roll 
any one of your numbers, you win $1.50. 
If you do not, you win nothing. 

B. You receive $0.55 for sure. 


Other cards used in determining the subjective 
probability of 3/8 were identical except for the 
amount of money which could be won in the 4- 
alternative. Similar sets of cards were prepared 
for each other probability. As in the utility 
measurement sessions, extra cards with the 
amount of money in the 4-alternative left blank 
were available, in case the indifference point fell 
outside the anticipated range. 

Model-testing sessions—Sessions 19-22 were 
the model-testing sessions. For the purpose of 
testing the model, the bets which have been 
extensively used in previous probability prefer- 
ence experiments were used again. Four lists 
of bets were used, with EVs of +$0.75, +$0.524 
and minus those same amounts. Table 1 gives 
those lists of bets. Each bet was paired with 
all others at its EV according to Ross’s optimal 
order for pairs in the method of paired com- 
parisons (12). The Ss were required to choose 


TABLE 1 


Bets Usep 1n THE MopeEL-TEsTING 
SEssIONS 








Positive Expected Value Bets 

















Amount Paid 
Probability | Payoff Cells 
$0.75 EV | $0.524 EV 
1/8 + $6.00 $4.20 
2/8 1,7 3.00 2.10 
3/8 2, 4, 6 2.00 1.40 
4/8 2, 4, 7,8 1.50 1.05 
5/8 (& |] Hm 1.20 0.84 
6/8 All but 3 & 6 1.00 0.70 
7/8 All but 5 0.86 0.60 
8/8 All 0.75 0.53 








Negative Expected Value Bets 


All negative expected value bets are identical 
with the corresponding positive expected value 
bets except that S loses the amount stated if the 
ball rolls into one of the payoff cells listed, in- 
stead of winning it. 





Note.—The exact wording of these bets can be 
found in (5), p. 88. 
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one member of each pair. These bets were run 
off on the pinball machine. The Ss made each 
choice four times, once in each session. 


RESULTS 


Utility measurement sessions.—Re- 
gression lines were fitted to the results 
of each block of four utility measure- 
ment sessions for each S by the 
method of least squares, or a simpler 
approximation when little error re- 
sulted. The point at which S would 
presumably choose each alternative 
50% of the time (the indifference 
point) was read from these lines. In 
three cases (out of 120) in which this 
procedure gave an obviously un- 
reasonable 50% point, the actually 
observed 50% point was used instead. 

To determine the utility functions, 
Equation 6 was solved for each 
amount of money used. The result 
of these calculations is three utility 
curves for each S. These curves are 
shown in Fig. 1. It is apparent that 
there are differences from one curve 
to another, but these differences do 
not seem to be systematic; they can 
probably be attributed to day-to-day 
variations in judgments. There is no 
evidence that the curve for Sessions 
5-8, in which the $0.15 small bets 
were used, is consistently smaller for 
the high values of money than the 
other two curves, as it would be if 
the utility of N bets was less than NV 
times the utility of one bet; or con- 
sistently larger, as it would be if the 
utility of N bets was greater than N 
times the utility of one bet. So it is 
reasonable to assume that this experi- 
ment remained within the sizes of N 
for which the N-bets assumption is 
acceptable. Since there does not 
seem to be any systematic difference 
between the different utility curves, 
the three utility values determined for 
each money value were averaged, and 
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Fic. 1. Experimentally determined individual utility curves. The 45° line in each graph is the 
curve which would be obtained if the subjective value of money were equal to its objective value. 
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Fic. 2. 
PEP are for winning bets. 


Experimentally determirted subjective probability functions. 


The functions labeled NEV are for losing bets. 


The functions labeled 


The 45° line is the 


function which would be obtained if subjective probability were equal to objective probability. 
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that average is used in all subsequent 
calculations. 

The most outstanding fact about 
Fig. 1 is the relatively high degree of 
linearity of the utility curves. Most 
of them adhere fairly closely to the 
45° line which represents the equation 
ug, = $A (where $A is measured in 
pennies). For those cases which de- 
viate substantially from the 45° line, 
the deviation seems to be greater for 
the negative branch of the ‘utility 
curve than for the positive branch, 
It should not be assumed that the 
utility of money is equal to its ob- 
jective value for all possible values 
even for those Ss who adhere to the 
45° line most closely ; instead, it seems 
likely that for large values of $A ($100 
and more, for instance), the value of 
usa will be quite different from $A. 
However, most college students have 
had extensive experience with re- 
ceiving and giving up amounts of 
money such as are used in this ex- 
periment. Their subjective impres- 
sions of the values of such sums of 
money are likely to be very close to 
the true values because of this long 
learning process. This generalization, 
if it is true enough often enough, offers 
a great simplification for future gam- 
bling experiments. If it is possible 
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to use the physical value of money 
instead of its utility in the SEU 
maximization model, then application 
of the model will be much easier. 
Subjective probability measurement 
sessions.—Graphs similar to those 
used in determining 50% points for 
the utility measurement sessions were 
used for the subjective probability 
measurement sessions, except that the 
X-axis contained amount of money 
rather than value of N. Mid-points 
were determined by the same methods 
used for the utility measurement 
sessions. Then equations like Equa- 
tion 8 were solved for each positive 
and negative subjéctive probability. 


‘Graphs of the resulting subjective 


probability functions are shown in 
Fig. 2. The most outstanding fact 
about Fig. 2 is that in general Ss tend 
to overestimate their chances of win- 
ning money, while retaining a fairly 
objective estimate of their chances of 
losing it. This result agrees pretty 
well with casual observation and com- 
mon sense. Notice that one subjec- 
tive probability function goes slightly 
above 1. This finding, though sur- 
prising, is permitted by the SEU and 
SEM models. 

+The test of the SEU maximization 
hypothesis.—The choices made during 


TABLE 2 


NuMBER oF Correct AND Incorrect Prepictions Basep on THE SEU 
MaximizaTion Hyporuesis 








Subject 





Mean 





Right | Wrong | Right | Wrong | Right 


Wrong | Right | Wrong | Right | Wrong | Right | Wrong 





+$0.52-1/2 | 100 12 71 41 99 





+$0.75 81 31 73 39 | 100 
—$0.52-1/2| 78 | 34 71 41 61 
— $0.75 64 | 48 87 25 45 

















13 107 





5 72 40 | 89.8 | 22.2 
12 105 7 93 19 | 90.4 | 21.6 
51 95 17 66 46 | 74.2 | 37.8 
67 91 21 70 42 | 71.4 | 40.6 














_ _Note.—By chance 56 correct predictions should be expected in each group. 
significantly different from 56 at the .05 level; 70 at the .01 level; and 74 at the .001 level. 


By chi square, 67 predictions is 
Accordingly, all pre- 


dictions are significantly better than chance except those for Ss 1 and 3 on the — $0.75 EV level and for Ss 3 and 


5 at the — $0.52-1/2 EV level. 
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the model-testing sessions, Sessions 
19-22, are not interesting in them- 
selves. They were very similar to 
the usual results of probability prefer- 
ence experiments (3, 5). The in- 
teresting thing about these choices is 
whether or not they correspond to the 
predictions made by the SEU model. 
Table 2 shows the number of correct 
and incorrect predictions for each of 
the four EV levels foreach S. It is at 
once apparent that the model predicts 
significantly better than chance (i.e., 
predictions from the model are better 
than those that could be made by 
flipping a coin). It is also apparent 
that the predictions from the model 
include a number of errors. These 
errors are of two kinds. First, there 
are cases in which Ss chose one way 
on a given pair in some of Sessions 
19-22 and the other way on other 
sessions. Since the model predicts 
100% choice of the alternative with 
the higher SEU, any such day-to-day 
variation in choices results in errone- 
ous predictions. Such errors do little 
damage to the model. The other and 
more serious kind of error occurs when 
S chooses consistently in all four 
model-testing sessions, but the SEU 
model predicts the opposite of the 
choices which actually occur. On the 
average, each S made 18.8 errors of 
the first kind and only 10.6 errors of 
the second kind at each EV level. 
(Note that this means that only an 
average of 2.65 pairs per S per EV 
level were completely wrongly pre- 
dicted; each completely wrong pre- 
diction produces four errors.) An 
analysis of variance of the errors shows 
that this difference is very highly sig- 
nificant. Variance due to Ss, of 
course, is also significant. And the 
difference between errors on positive 
EV and negative EV bets is signifi- 
cant. This last variable interacts 
with Ss. The nature of the’ effect, 
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as can be seen from inspecting Table 
2, is that errors are in general more 
frequent on negative EV than on 
positive EV bets, but this is not true 
for some Ss. It is interesting that 
neither Ss nor positive EV bets vs. 
negative EV bets interacts signifi- 
cantly with consistent vs. incon- 
sistent errors. 

The next task is to account for the 
wrong predictions. Some of them, 
as we have seen, result from incon- 
sistent choices in the model-testing 
sessions. Others may perhaps be 
accounted for by the size of the SEU 
differences involved. An obvious hy- 
pothesis is that Ss are more likely to 
choose consistently with prediction 
when the two bets between which they 
are choosing are far apart from one 
another in SEU. This hypothesis is 
strengthened by the fact that the 
median SEU difference between pairs 
of bets is much smaller for the nega- 
tive EV bets for all Ss than it is for 
the positive EV bets. This is con- 
sistent with the fact that, in general, 
prediction is less satisfactory on the 
negative EV bets. In order to test 
this hypothesis more precisely, prod- 
uct-moment correlation coefficients 
were calculated between the size of 
the SEU difference between each pair 
of bets and the number of choices 
consistent with prediction on that 
pair. These correlations are small 
(median value: + .23), but all of 
them are positive. The reason they 
are small is that the number of 
choices consistent with prediction can 
have only five values, 0, 1, 2, 3, and 
4. When a variable which can assume 
only a few values is correlated with 
some other variable, the correlation 
coefficient is inevitably low. How- 
ever, a simple sign test shows that the 
probability of obtaining 20 correlation 
coefficients with the same sign, if the 
true correlation is 0, is 2 (1/2)*— 
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TABLE 3 
NumBer or Correct AND Incorrect Prepictions Basep on THE EU 
MaAxIMIZATION Hyporuesis 
Subject 
Expected Mean 
Value 1 2 3 4 5 

Level | eas 
Right | Wrong | Right | Wrong Right [Wrong Right | Wrong | Right | Wrong | Right | Wrong 

+$0.52-1/2| 12 | 100 | 73 | 39 | 41 | 71 | 37] 75 | 20 | 92 | 366] 75.4 
+$0.75 19 93 59 53 40 72 25 87 41 71 36.8 | 75.2 
—$0.52-1/2 | 38 74 57 55 39 73 103 9 66 46 | 60.6 | 51.4 
— $0.75 24 | 88 63 49 47 65 99 13 82 30 63.0 | 49.0 











very small indeed. This confirms the 
hypothesis that there is a positive 
relationship between size of SEU 
difference and number of choices 
consistent with prediction. 

A test of the EU maximization hy- 
pothesis.—The next question to ask 
is: How well can the utilities alone 
and the subjective probabilities alone 
predict choices? 

To test the predictive value of the 
utilities considered alone is essentially 
equivalent to testing the EU maximi- 
zation model proposed by Bernoulli 
and von Neumann and Morgenstern. 
In order to do it, all that is needed is 
to calculate the EU for each S for 
each bet, using Equation 2. This was 
done. The resulting predictions were 
compared with the choices made in 
the model-testing sessions. The re- 
sults of this comparison are shown in 
Table 3. It is at once evident that 
the EU maximization hypothesis does 
not predict better than chance. In- 
deed, if an analysis of variance of the 
error scores is made and the error 
estimate from that analysis is used to 
estimate the standard error of the 
mean, then the EU maximization 
hypothesis predicts significantly (P 
= .05) worse than chance on the 
positive EV bets. (The results of 
that analysis of variance, incidentally, 
show that Ss, the difference between 


positive EV and negative EV bets, 
and their interaction significantly 
affect the number of errors.) None 
of the various other comparisons 
which were reported for the SEU 
model offers any evidence, when made 
on the EU model, that the EU model 
is any good. So far as the evidence 
of this experiment goes, it is necessary 
to conclude that the EU maximization 
model fails to predict simple gambling 
decisions. 

The reason for the failure of the EU 
maximization model is, presumably, 
the near-linearity of the utility scales 
used. Ifthe utility scales are actually 
relatively linear, then any predictions 
of choices between constant-EV bets 
must be made on the basis of minor 
irregularities in the curves, and it is 
not surprising that such predictions 
fail to work out. The only successful 
predictions based on EU values, S 
4’s NEV bets, are consistent with this 
argument. This S’s negative utility 
curve deviates substantially from the 
uga = $A line, and his negative sub- 
jective probability curve deviates very 
little from the P* = » line. 

The relative linearity of the utility 
curves, if it is a general phenomenon, 
means that the EU maximization 
model will never, as a practical matter, 
be very useful in gambling experi- 
ments. This does not, of course, 
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preclude the possibility that it might 
be useful in other situations, par- 
ticularly those which use larger sums 
of money. However, the evidence 
does strongly suggest that whenever 
the EU maximization model is useful, 
the SEU maximization model will be 
more useful. 

A test of the subjectively expected 
money (SEM) maximization model.— 
We have examined the SEU and EU 
maximization models. It remains to 
examine the SEM _ maximization 
model, in which prediction of choices 
is affected only by subjective proba- 
bilities. In the original calculation 
of the subjective probability scales 
given in Fig. 2, utility values were 
used. If the predictive value of the 
SEM maximization model is to be 
tested, it is not permissible to use 
utilities in the original calculation of 
the subjective probabilities which go 
into the model. Consequently, all 
subjective probabilities were recalcu- 
lated using money values instead of 
utility values in equations like Equa- 
tion 8. The resulting subjective 
probability functions differ only in 
minor ways from those given in Fig. 
2. 

For each bet used in the model- 
testing sessions, an SEM value was 
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calculated from the new subjective 
probability functions. Then the pre- 
dictions based on the hypothesis of 
SEM maximization were compared 
with the choices in the model-testing 
session. Table 4 shows the results. 
It is at once apparent that the SEM 
maximization model predicts quite 
well. Comparison of Table 4 with 
Table 2 shows that the SEM maximi- 
zation model predicts nongignifi- 
cantly better than the SEU maximi- 
zation model. Analysis of variance 
of the SEM model errors gives es- 
sentially the same picture as that for 
the SEU model. The same generali- 
zations about errors and about the 
superiority of positive EV predictions 
over negative EV predictions which 
were made for the SEU model apply 
to the SEM model also. Indeed, the 
only respect in which the SEM model 
is inferior to the SEU model is that 
when the correlations between SEM 
difference and number of choices 
consistent with prediction are calcu- 
lated, only 17 instead of 20 of the 
correlation coefficients come out posi- 
tive. The probability of getting so 
many correlation coefficients of the 
same sign if the true correlation is 0 
is .0026. 

It must be concluded that the SEM 


TABLE 4 


NuMBER oF Correct AND Incorrect Prepictions Basep on THE SEM 
MaximizaTion Hyporuesis 











Subject 
Expected —_—— Mean 
Value 1 2 3 4 5 
Level 





Right | Wrong | Right | Wrong | Right 


Wrong | Right | Wrong | Right | Wrong | Right | Wrong 





+$0.52-1/2 | 100 12 71 41 99 


+$0.75 93 19 73 39 | 100 
—$0.52-1/2| 78 | 34 69 43 61 
— $0.75 76 | 36 75 37 45 




















13 107 5 72 40 | 93.4) 18.6 
12 105 7 93 19 | 92.4 | 19.6 
51 93 19 66 46 | 74.2 | 37.8 
67 91 21 70 42 | 72.2 | 39.8 























_ _Note.—By chance 56 correct predictions should be expected in each group. By chi square, 67 predictions is 
significantly different from 56 at the 0.05 level; 70 at the 0.01 level; and 74 at the 0.001 level. Accordingly, all 
es are significantly better than chance except for S 3 at both negative EV levels and S 5 at the — $0.52-1/2 
evel. 
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model predicts choices just as well as 
the SEU model does. Indeed, ex- 
amination of the predictions shows 
that the SEM model and the SEU 
model usually predict the same or 
almost the same rank order for the 
bets. The reason why is that the 
SEM values correlate highly with the 
SEU values. The median correlation 
is .96; the lowest is .82. 


Discussion 


If the results of this experiment are™ 


taken literally, they mean the following 
things. First, over the range of money 
used in this experiment (+ $5.50 to 
— $5.50) the utility of money is fairly 
linearly related to its dollar value for 
most Ss. Secondly, the SEU maximi- 
zation model predicts choices among 
simple bets rather well. Thirdly, there 
is a positive (though not perfect) cor- 
relation between the size of the SEU 
difference between two bets and the 
probability that the S will choose the 
one with the larger SEU. Fourthly, the 
EU maximization model is no good at 
predicting choices among simple bets 
like those used in this experiment. This 
is presumably because utility is fairly 
linear with money value over the ranges 
used in this experiment, and so the sizes 
of the EU differences among the bets 
used in the model-testing sessions of this 
experiment are very small. Finally, a 
model which asserts that Ss choose so 
as to maximize SEM predicts just as well 
as the SEU maximization model, and 
for the most part makes the same pre- 
dictions. This fact can be explained, of 
course, by the fact that the utility curves 
are approximately linear, so that SEM 
and SEU values for any bet are very 
highly correlated with one another. 
Most of these conclusions depend for 
their validity on the assumption that the 
N-bets method is a satisfactory method 
of utility measurement. The internal 
consistencies in the results which indicate 
its acceptability have already been dis- 
cussed. 

The results of this experiment are 
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entirely consistent with the results of 
the various probability preference ex- 
periments. They suggest that the most 
important determiner of choices among 
bets is subjective probabilities, and that 
bets with the same subjective probability 
ought to be preferred to about the same 
extent when compared with others at the 
same EV level. That is just what 
happened in one probability preference 
experiment (5). 

Generalizations from this experiment 
should be made cautiously. First, nei- 
ther the SEU model nor this experiment 
offers any way of predicting choices 
among bets which include possibilities 
both of winning and of losing. The 
SEU values of the winning possibilities 
and of the losing possibilities can be 
algebraically added to get a composite 
SEU, but whether such composite SEU 
values will predict choices is an experi- 
mental problem. Secondly, the SEU 
model has no true zero point, and so 
does not predict whether a given bet 
will be accepted or refused, if the alterna- 
tive is simply not gambling. Finally, 
there is no guarantee that subjective 
probabilities are independent of the 
utilities with which they are associated, 
though the SEU model assumes that 
they are. With very large bets or very 
small bets, the form of the subjective 
probability function might be quite 
different from that found in an experi- 
ment like this one for a given S. All of 
these questions can be settled by ap- 
propriate experiments. 

More theoretical work needs to be done 
on the SEU model. Ideally, it should be 
a probabilistic model, predicting the 
probability of choice of the alternative 
with the higher SEU as a function of the 
size of the SEU difference. The obvious 
form for a theoretical curve relating 
probability of choice of alternative 4 
over alternative B to size of SEU, — 
SEUz is a normal ogive, with the 50% 
point at a SEU difference of 0. But 
other intuitively plausible curves can 
also be conceived. At any rate, until 
better methods of measuring utilities and 
subjective probabilities are found, there 
is little reason for going into such theo- 
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retical refinements, because there is no 
hope of testing them experimentally. 
The improvement of methods of utility 
and subjective probability measurement 
is still the most important task in the 
study of gambling behavior. 


SUMMARY 


A very simple mathematical model was de- 
veloped for predicting choices among bets. This 
model, based on the concepts of subjective value 
or utility of money and subjective probability, 
asserts that Ss choose the bet with the maximum 
subjectively expected utility. An experiment 
was designed to test this model. The utility 
of money was measured by assuming that the 
utility of N identical bets is NV times the utility 
of one such bet—an assumption which the results 
of the experiment verify. Subjective proba- 
bilities were measured by other choices. Finally, 
still other choices were used to test the model. 
The model predicted substantially better than 
chance. Another model, which assumes that Ss 
choose so as to maximize expected utility, failed 
to predict choices successfully. A third model, 
which uses only the concept of subjective proba- 
bility, predicted as well as the subjectively ex- 
pected utility maximization model. 

It is concluded that the subjectively expected 
utility maximization model is adequate to 
account for the results of this experiment, and 
that subjective probabilities are much more im- 
portant than utilities in determining choices 
among bets such as those used in this experiment. 
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PERFORMANCE AND ITS EFFECTS UPON 
PART-PART AND PART-WHOLE 
RELATIONS! 
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A previous article (1) discussed 
some of the possible contributions of 
component activities to the: total 
psychomotor task and suggested some 
possible means of examining the re- 
lations between components of a task 
and the total task. That article 
should be considered as the intro- 
duction to the experiments reported 
here. The previous paper dealt with 
the analysis of simultaneous and non- 
simultaneous occurrences of two or 
more responses in fairly complex 
psychomotor performance situations. 
Data were presented on two tasks 
which showed that means and vari- 
ances for criterion combinations of 
on-target scores could be fairly ac- 
curately predicted by assuming that 
the times of occurrence of scoring in 
one response dimension are _ inde- 
pendent of the times of scoring in a 
second dimension. This can be ex- 
pressed more precisely as follows, 


A, X B. = AB, = AB, (1) 


where A and B represent two different 
response dimensions, AB represents 
the combination score or the propor- 
tion of time on target in both dimen- 
sions simultaneously, and o and p 
identify proportions (probabilities) 
of time on target as obtained and 
predicted, respectively. 

! This research was carried out under the Air 
Force Personnel and Training Research Center, 
Lackland Air Force Base, San Antonio, Texas, 
in support of Project 7707. Permission is 
granted for reproduction, translation, publi- 
cation, use, and disposal in whole or in part by 
or for the United States Government. 
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The hypothesis of independence 
among component behaviors was 
tested with the Pedestal Sight Man- 
ipulation Test (2) and the Stevens 
Pursuitmeter (3). The _ results 
showed a slight tendency for a 
positive relation between two com- 
ponents and a zero relationship if a 
third component were added (ABC). 
At any rate, no consequential error 
(in a practical sense) was made in 
predicting AB proficiency whenever 
independence was assumed.? 

An implication of the chance re- 
lationship is that redistributions of 


? Unfortunately, statistical theory does not 
yet appear sufficiently advanced for adequate 
tests of significance. The product of two inde- 
perident variables, each of which is normally 
distributed, is not normally distributed. Further, 
the distribution of a product can be skewed. 
Since the sampling distribution of the product 
is unknown, it is hazardous to compare observed 
and predicted means or variances. 

Because of the difficulties over statistical 
comparisons of the data, the question of the 
desirability of collecting data at all is raised. 
This question is not easily laid to rest. The 
author believes that a contribution can be made 
without making a statistical evaluation of the 
significance of the difference between means. 
For example, if it can be said that 4B, — AB, 
= 2 percentage points or less for several psy- 
chomotor devices, and for extended periods of 
practice, the statement appears worthwhile 
without statistical evaluation. At least, an- 
other investigator could expect to get similar 
discrepancies under similar circumstances and 
could probably assume that Formula 1 will lead 
to values of 4B, which are within tolerable or 
practical ranges of 4B. More sophisticated 
evaluations of the data are, of course, desirable, 
and it is hoped that a treatment of the distri- 
bution of products will be made available. 
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part proficiencies will alter the mag- 
nitude of the combination score in 
specifiable ways. It was also stressed 
that the alteration of the combination 
score can perhaps be attained without 
changing S’s fundamental ability to 
perform the parts of the total task 
(when each part is practiced sepa- 
rately). As an example of this point, 
suppose an S practicing two parts 
simultaneously scores 4, = .90 and 
B, = .20. For this distribution of 
part proficiencies, a proportion of .18 
should be obtained as a criterion of 
proficiency at remaining on target in 
A and B simultaneously. Suppose 
that the same S, however, could score 
A, = 80 and B, = .30. In this case, 
the predicted AB score is .24 or 6 
points higher than with the previous 
distribution, but the sum of the indi- 
vidual probabilities, 4 and B, is not 
changed. If it can be experimentally 
demonstrated that S can alternate 
from one distribution to the other 
from trial to trial, then it might be 
allowed that S is exhibiting a constant 
skill level, yet systematically varying 
his criterion score. 

The previous paper outlined two 
methods by which part proficiencies 
might be alterable when all parts are 
simultaneously introduced, without 
changing S’s ability to perform each 
of the parts when performed as parts; 
both methods are used in the present 
study. In the first method, S’s at- 
tention to environmental stimuli (4 
and B display needles) is redistributed 
through manipulation of knowledge 
of results (KR); this is done by peri- 
odically informing S which of his part 
proficiencies is poorest. In the second 
method, S’s attention is redistributed 
in a manner procedurally different 
from the first; this is done by in- 
structing S to pay twice as much 
attention to one particular part of the 
task. . 

The study outlined above must 
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allow the individual S a relatively 
long period of practice because the 
task must be difficult and it must be 
assumed that S’s performance of the 
parts is stable. Thus, the experi- 
ment is necessarily lengthy. The 
study is broken down into two ex- 
periments. Basically, the first ex- 
periment represents but an extended 
training period for assuring stable and 
proficient performance. The ex- 
tended training given in ‘Exp. 1, 
however, also afforded opportunity 
for evaluating the effects of adding 
certain kinds of KR both early and 
late in training. In Exp. 2, the Ss 
were subjected to extreme conditions 
calculated to alter the usual modes of 
attending to a multidimensional task 
in order to examine part-part and 
part-whole relations. 


METHOD 


Apparatus —Two units of the SAM Multi- 
dimensional Pursuit Test (CM813E) were used. 
This apparatus is described by Melton as follows: 

. . . consists of an instrument panel and 
simulated airplane controls. During a test 
trial, the various pointers of the panel meters 
drift about in an irregular and continuously 
varying pattern. The subject’s task is to 
manipulate the controls so that the meter 
pointers are kept at specified points on their 
respective scales. The ‘stick’ controls the 
‘climb’-and ‘bank’ indicators, the rudder pedals 
control a ‘turn’ indicator, and the throttle 
governs the position of the needle on an ‘air- 
speed’ indicator. 

“Movement of the various needles on the 
instrument panel is governed by a motor which 
turns four cams, spaced at intervals on one 
shaft. Each cam controls the position of the 
sliding contact on a wire-wound potentiometer. 
Corresponding to each of the motor driven 
potentiometers there are four potentiometers 
which are connected to the various airplane 
controls. When one of the motor driven 
potentiometers shifts a needle from its specified 
point on the scale, a compensatory adjustment 
of the appropriate airplane control will bring the 
needle back to its origin” (3, pp. 7-18). 

The basic apparatus above was modified to 
present a simpler display panel. Indicator 


scales of simple and standard design were sub- 
stituted for the original scales. The modified 
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TABLE 1 


DispLay-ConTROL RELATIONSHIPS FOR THE 
MULTIDIMENSIONAL Pursuit Test 








contrat | Sestrot | indicator | Fadiator 
Stick (left 
hand) forward air speed (S) | right 


backward | air speed (S)| left 
Pedals (feet) 


right heading (H)| right 
left heading (H) | left 
Stick (right 
hand) forward | altitude (A) | right 
backward | altitude (A) f left 
Stick (right 
hand) right bank (B) right - 
left bank (B) left 














scales represented zero at the center and devia- 
tions from zero by markers spaced at uniform 
intervals to left and right of zero. The S’s 
task, then, can be described as attempting to 
zero four indicator needles in a compensatory 
manner by appropriate manipulations of four 
controls. The display-control relationships are 
outlined in Table 1. For simplicity, the four 
controls or components are called S, H, A, B. 
Procedure in Exp. 1 

The variable was the kind of KR administered 
to 72 naive airmen from Lackland AFB. Three 
different varieties of KR were evaluated during 
a 20-day period of practice. After 60-sec. trials 
of practice, the Ss of Treatment LC (lowest 
component) were told which of their four com- 
ponents ranked lowest; the Ss of Treatment RC 
(random component) were told “which of their 
four components ranked lowest,” but the com- 
ponent was actually selected by E on a random 
basis, and Ss of Treatment NC (no component) 
were given no extra KR by E. Treatment RC 
was included as a test of the effects of the mere 
identification of a component for S, as opposed to 
correct identification of the worst component. 
Treatment RC, then, was used as a sort of moti- 
vational control. Treatment NC represents a 
standard manner of administering a psychomotor 
test without giving KR beyond that which S 
can see for himself, and Treatment LC repre- 
sents a not uncommon manner of providing a 
certain amount of extra KR, although by no 
means the only way. 

By using different treatment sequences for 
four groups, it was possible to arrange for a test 
of the effects of KR when introduced im- 
mediately or relatively late within the practice 
session. The sequence of treatments for the 
four groups is presented in Table 2. 

Standard Electric timers measured the cumu- 


lative time that the controls were within the 
target extent definitions. Besides timers for 
each of the four components, three additional 
timers usually took measures of AB, HAB, and 
SHAB. AB, for example, represents the time 
the controls were correctly positioned within the 
limits of A and B simultaneously. The time- 
on-target scores were recorded after each 60-sec. 
practice trial and all time scores in this paper 
are presented as percentages. 

Although Ss were not so instructed, the on-off 
target limits for each of the four indicator needles 
were two units to the right and left of the zero 
point (deviation of any needle from zero by as 
much as, say, .25 units was clearly discrimi- 
nable). The instructions emphasized that the 
task was to center, and to keep centered as much 
as possible, all of the needles. It is unlikely 
that the Ss identified the scoring limits. 

Additional points of procedure were as 
follows: the number of trials per day was 8, 
rest between trials was 60 sec., number of Ss 
per group was 18, and number of days of 
practice was 20. 

On Days 2, 6, 10, 14, and 18, a multichannel 
operations recorder was used to record various 
aspects of momentary on-off target performance. 
The clocks and recorder were wired in parallel 
so that the readings of one instrument could be 
checked against the other. Records were made 
of 8, H, A, B, and SHAB on the last 7 trials, of 
AB and HAB on Trials 2, 3, 7, and 8, of SA and 
SB on Trial 4, of AH and BH on Trial 5, and 
of AB and BH on Trial 6. These records made 
possible an analysis of time of occurrence and 
duration of many on-off target events at selected 
points during the learning session. It was not 
convenient to obtain a record of all possible 
combinations at any given moment of practice. 


Procedure in Exp. 2 

Experiment 2 began on Day 21 of practice 
because proficient Ss were required. The 72 
Ss of Exp. 1 were reassembled into two matched 
groups and run an additional four days in a 
manner identical with that of the preceding 20 
days except as noted below. 


TABLE 2 
EXPERIMENTAL Desicn, Exp. 1 


| 
| Blocks of Days 


Group — 


LC (Lowest of 4 components) L¢ LC LC 
RC (Random one of 4 com | 
ponents) RC | R¢ LC 
NC (No component reported | 
lowest) NC LC L¢ 
NC (No component reported | | 
lowest) N¢ RC LC 
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Group C (control).—After cach trial, a ran- 
domly selected component was identified as 
lowest. The Ss were instructed that these 
scores represented their worst component. 
(Same as RC, Exp. 1.) 

Group SI (special instruction)—These Ss 
were given their lowest component after each 
trial and told, “From now on I want you to pay 
twice as much attention to your A needle. 
Watch the A needle twice as often as you do any 
other needle and be sure that you make about 
twice as many corrections on the A control as 
any othercontrol. I’ll tell you what your lowest 
score is after each trial.” 

Another important difference between Exp. 
1 and 2 is that the scoring band for A was reduced 
by one-half (to 1 marker right and left of zero) 
for both groups. It was, however, almost im- 
possible for any S to deduce this change in 
calibration. The Ss of Group SI, of course, had 
the advantage of being instructed to attend 
twice as often to A (needle and control) as to 
any other needle and control. A further ad- 
vantage was that whenever A ranked lowest, A 
was reported as lowest. Thus, two variables 
were purposely confounded in Group SI. Both 
variables should act in the same direction and 
should lead to better A performance for Group 
SI. The reason for using two variables for this 
group was that substantial differences between 
component performances from group to group 
were desired. There was no intention of par- 
celing out the separate effects of instructions (to 
pay twice as much attention to A) and identi- 
fying the lowest component. Experiment 2, 
then, represents extreme conditions calculated 
to disrupt the usual modes of responding to a 
four-component complex.’ 

For the last day of practice (Day 24) addi- 
tional change in the behavior of Group SI was 
sought by arbitrarily reporting B, after each 
trial, to be the lowest component of the four. 
The Ss were then faced with the problem of 
attending twice as often to A as to any other 


3’ Halving the scoring area does not alter S 
behavior, but does alter the magnitude of the A 
scores obtained. This procedure results in a 
situation where one component of a complex is 
more important or critical than the others, for 
it alters the weighting of A in the combination 
scores. 

Halving A should result in A’s ranking fourth 
provided no special instructions for attending to 
A have been administered. Actually, A had an 
average rank of 2.2 on Trial 1 of Day 21 for 
Group SI, demonstrating the immediate effect 
of the special instructions. On later trials, if 
any S’s attention to A tended to lapse, the LC 
procedure could be expected to augment A. 
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TABLE 3 
Mean Dirrerences Between TRIALS 2 
AND 3 In PERCENTAGE Points 
on Day 2* 





— —— a 4+ —— — ———— 
Co yone 
coun oxen | Sum Sum 
¥ I | = a — Cc 
|siniz | B | 
SE a a — 
Lc |+2.6|-2.2|42.1|441] +91 | —.9 
RC |+7.7|~1.5|+1.1 $1.2) +4.4 | +14 








*Sum C* represents the mean total gain for any of 
four components identified for S as the avorst com 
ponent. Sum C™~ represents the average gain for the 
three components not represented to S as the worst 
component. 


component, while acknowledging that their 
worst performance was with respect to B. 


RESULTS 


It is not possible to present all the 
data nor to present all the comparisons 
which might be of interest. For 
example, over 96,000 time scores were 
recorded. General tabular summaries 


of the data have been deposited with 
ADI.4 


Experiment 1 


Examination of the daily means of 
components and combinations over 
Days 1-20 led to the conclusion that 
there was little, if any, difference 
between groups. However, even 
casual observation of other aspects 
of the data showed promise of dif- 
ferences when, say, Trial + 1 is 
examined in the light of which com- 
ponent was reported as lowest on 
Trial n. A thorough trial-by-trial 
analysis, though of general interest, 
is omitted from the present report 
because of space limitations. 

A cursory examination of this topic 
may be made, however, through the 
entries of Table 3. Group LC shows 


4Tables from which the figures were con- 
structed have been deposited with the American 
Documentation Institute. Order Document 
No. 4608 remitting $1.25 for microfilm or $1.25 
for photocopies. 
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a mean increase of 9.1 percentage 
points for the component represented 
to S as being his worst. On the other 
hand, the average gain of the other 
three components is negative (— .9) 
and substantial positive gains could 
otherwise be expected. This suggests 
that the gain for the worst component 
is at the expense of the three remain- 
ing components since the gain for any 
component tends to be associated 
with an LC treatment. The other 
entries represent the average gain for 
each component regardless of the KR 
schedule. A similar calculation for 
Group RC is represented in the bot- 
tom row; the conclusion here is much 
the same as for Group LC, except that 
component § is high for some un- 
known reason. 

Each of the seven major perform- 
ance measures increased in magni- 
tude as a function of day of practice. 
The grand means for all Ss for Days 
1 and 20 in Table 4 are suggestive of 
the level of performance and amount 
of improvement taking place. Actu- 
ally, peak performance was reached 
between Days 17 and 20 when the 
seven performance measures are sepa- 
rately examined over all days. It 
can be rather safely concluded that 
20 days of practice was about enough 
to bring each measure to, or very close 
to, its maximum (or plateau). 

Analysis of component independ- 
ence.—Obtained and predicted means 
for time on target in AB, HAB, and 


TABLE 4 
Granp Mean Scores ror Days 1 anp 20 
FOR SELECTED Response MEASURES 









































Measure 
Day | 
)S|H|A | B | AB | HAB| SHAT 
1 |51.1/53 2| 53.5| 57.5 |33.6]17.2| 8.6 
20 | 73.7 | 74.1 [vas 78.8* 59.6*| veel 32.8* 
*n = 71; otherwise » = 72. 


TABLE 5 


OsTatnepD AND Prepictep MEANs on TriAt 2 
FoR SeLectep Days ror Att Groups 
CoMBINED AND FoR Groups 
C anp SI 





(pt Bem ae | | 
Day | ABo ABp | Ane HAB,| SHAB, SHAB, 


Experiment 1 











2 139.0135.1/ 22.1210] 9.7 | 114 
6  |50.2|46.1| 33.3 | 30.9] 19.9 | 200 
10 $4.9|51.3| 36.7 | 34.5] 23.6 | 23.3 
14 $5.2151.6| 37.81 35.5| 251 | 247 
18 59.9 | 56.3 | 43.2 | 40.9} 30.5 | 29.4 





Experiment 2 


22 (all) 43.9 | 41.5 29.0 | 28 


5 | 17.6 | 18.2 
22C 141.8 | 38.7 | 30.7 | 29.5 | 20.3 | 20.9 
5 15.0 | 15.6 


22 SI | 46.0| 44.4] 27.3 | 27. 


SHAB are presented in Table 5, where 
the analysis through Day 18 is of 
immediate concern. AB, consistently 
underestimates AB, by about 4 per- 
centage points for each of the five 
day samples, suggesting the relation- 
ship between A and B is probably 
(a) positive and small, and (bd) in- 
variant with practice. A comparison 
of HAB, and HAB, leads to the same 
suggestion except that the absolute 
discrepancy is smaller, i.e., circa 2 
percentage points. In the comparison 
of SHAB, with SHAB,, the discrep- 
ancy is even less, but there is a very 
slight suggestion that the predicted 
mean overestimates the one obtained 
early in practice, but underestimates 
it late in practice. This is the only 
evidence of a changing dependency 
as a function of practice. 

The analysis thus far has em- 
phasized A and B, where but one 
control is operated in two dimensions. 
Comparisons across controls for pairs 
AH, BH, AS, and BS were also made. 
The striking feature of all comparisons 
was that the predicted mean was 











220 


always very close to the one obtained. 
It appears that the greatest difference 
between the obtained and predicted 
scores was with respect to AB. The 
graphic recordings were used to pursue 
this lead further. 

The relationship of A and B.—The 
graphical records for Days 2 and 18 
were scored for the four possible 
events with respect to scoring in A 
and B. All 72 Ss were scored for 
Trial 2, the examination of the record 
being made at each of 60 Il-sec. 
intervals during the trial. The 72 
2 X 2 contingency tables were analy- 
zed by groups, the hypothesis of in- 
dependent events being used to calcu- 
late predicted frequencies. All four 
x?’s were significant beyond the .05 
level. Of the 72 tables, 60 showed 
obtained AB frequencies greater than 
those predicted, and 12 obtained fre- 
quencies were smaller than _ those 
predicted. The latter were fairly 
uniformly distributed over the groups. 

For Day 18, as compared with Day 
2, the x?’s were generally larger. The 
x? for each of the four groups was 
significant beyond the .01 level, but 
the mean discrepancy between pre- 
dicted and obtained AB values de- 
creased from 3.4 (Day 2) to 2.6 points 
(Day 18), indicating a greater than 
chance dependency, yet strongly ar- 
guing against an increasing strength 
of dependency as a function of prac- 
tice. Further evidence to support 
this contention was that 16 of the 72 
analyses showed a negative depend- 
ence between A and B (compared 
with 12 on Day 2). <A Pearson r was 
computed between the discrepancies 
(AB. — AB,) from a chance relation- 
ship for Days 2 and 18. This r was 
+ .05 and indicates that the indi- 
vidual differences as regards depar- 
tures from chance are not strong, to 
say the least. , 
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TABLE 6 


Experiment 2: Darty Means ror Eacu 
ComMPONENT OF PRacTICE FoR Groups 

















C AND SI* 
— == —————— 
Group Measure | nae 
| 21 | 22 23 | 24 
Cc]. A I RR 
|S 72.8 | 73.7) 71.2) 74.4 
| H | 72.6) 73.6| 73.4] 74.4 
| A | 48.0 45.3) 48.8] 48.4 
B | 79.5| 79.3! 80.7| 79.3 
AB | 41.3 42.0| 40.9 
HAB 29.9 | 30.3} 30.3 
| SHAB | 22.3] | 22.1] 23.8 
| 2 (S,H,A,B) [273 272 [274 |277 
> (A,B) 128 125  |130 }128 
SI | 
| § | 63.0| 62.2] 61.3| 614 
| H | 63.2| 60.9| 63.1] 60.2 
\ | 68.0} 62.8) ok 61.8 
|B 71.1] 70.8| 71.3} 78.6 
| AB | 49.8 49.2| 50.0 
| HAB | 30.9 | 30.2| 29.1 
| SHAB | 20.4 18.6| 18.9 
> (S,H.A,B) |265 |257 |262 |262 
| > (A,B) 1139 | 134 138 1140 
*n = 36. 


Experiment 2 

The mean percentages of time on 
target for Groups C and SI are given 
in Table 6. The effect of reducing 
the scoring area of A by one-half is 
immediately apparent in the scores 
of Group C when the level of A is 
compared with the levels of S, H, and 
B. From Day 20 to Day 21, A has 
fallen approximately 24 percentage 
points to rank a very poor fourth on 
Day 21. By contrast, the A score 
for Group SI ranks second and has 
fallen from Day 20 to 21 about 4 
percentage points. Group SI, how- 
ever, shows a loss in all components. 
The losses from Day 20 to Day 21 are 
shown below. 
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Group Ss H A B AB 
© —1 —-l1 —24 0 —19 
SI —-ll —1ll —-4 -8 —-10 


From Day 20 to Day 21, Group C 
has lost 19 points in AB and Group 
SI has lost 10 points. Apparently, 
instructing the Ss of Group SI to pay 
twice as much attention to A has 
resulted in about one-half the loss to 
be expected in AB when the scoring 
area of A is halved. Group SI, 
however, has sacrificed on three com- 
ponents for a relatively high A. The 
over-all consequence of this sacrifice 
is reflected in SHAB, which is less 
(numerically) for SI than for C, 


while HAB is just slightly larger for 
SI. To summarize, Group SI is 
better at A and AB, about equal in 
HAB and SHAB, and worse at S, H, 
and B. Apparently, verbal instruc- 
tion has made a difference in the 
distribution of scoring. Whether the 
fundamental relationships of simul- 
taneity or nonsimultaneity have been 
disturbed by the rearrangements of 
scoring levels must next be deter- 
mined. 

Analysis of independence.—This 
analysis can be found in the bottom 
two rows of Table 5. A comparison 
of those combinations involving A 
and B shows AB, > AB,, and very 
close agreement among the higher 
order combinations. Nonetheless, the 
discrepancy between AB’s runs about 
the figure expected or a little less and 
does not appear disturbed by the SI 
Treatment. As for the other possible 
pairs of components, AS, BS, etc., the 
hypothesis of independence seemed 
even more reasonable for within- and 
between-group comparisons. 

Analysis of Day 24.—On Day 24 the 
procedure remained unchanged except 
that Group SI always received B as 
the worst component. The conse- 
quences of this treatment can be 


tm 
_ 


partly assessed by comparing the 
results of Day 23 with Day 24 in 
[able 6. Component B for Group 
SI increases by 7 points, and A de- 
creases by 5. No such changes show 
in the data of the control (C). Group 
SI maintains a constant value of AB 
across days as does Group C, but AB 
is greater for Group SI. The treat- 
ment for Day 24, then, illustrates two 
points: (a) knowledge of the “worst 
component” (the same one repeat- 
edly) can increase a part score (B), 
and (b) this is accomplished at the 
sacrifice of some other component 
(A). Further, the consequence of 
events a and b was a constant level 
of scoring in AB from day to day. 

A finer analysis of the data within 
Day 24 was made by examining the 
trial-by-trial scores. During the 
course of the 8 trials, Group SI 
showed (a) a progressive decline in A 
towards the level of A for Group C, 
(b) an increase in B to the level of B 
for Group C by Trial 4, and (c) a 
consistent superiority -in AB over 
Group C. The trend of AB for Group 
SI was, of course, correlated with the 
changes in A and B. The B trend 
tended to augment AB, and the A 
trend tended to decrease the magni- 
tude of AB. Inasmuch as AB is more 


TABLE 7 
[EXPERIMENT 2: 
OBTAINED AND Prepictep MEANS on 


Day 24, Triats 1 AND 8, FoR 
Group C anp SI * 


Measure 
Grouy Trial 
| A B. AB | AB, 
l $1.2 | 81.3 | 44.1 | 41.9 
Cc 
| 8 | 52.4] 76.1 42.6 | 40.3 
. oe 68.7 | 74.4 | 53.0 | 51.1 
SI | 
L«® 57.9 | 80.9 | 47.4 | 46.5 
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sensitive to changes in A than in B 
(since A < B), the over-all trend of 
AB should be negative. The trend 
of AB was in line with expectation, 
but its complexity posed problems 
which defied statistical evaluation. 
An abbreviated set of trial data is 
presented in Table 7. 

Finally, a test of independence for 
both groups on Day 24 must be made 
in order to determine if change in 
mode of responding upsets the pattern 
of independence. Unfortunately, no 
graphical recordings were available, 
but a test on the basis of clock scores 
can be made. The analysis for Trials 
1 and 8 is summarized in Table 7 and 
shows the mean discrepancy to be 
about equal from group to group. 

Analysis of component sums, Days 
21 through 24.—Summations over 
certain component values of Table 6 
suggest that total scoring within 
groups over successive days is inde- 
pendent of the treatments in that the 
sums are nearly equal from day to 
day. For example, consider >> (S, 
H, A, B) presented in Table 6. The 
critical comparison is, of course, be- 
tween Days 23 and 24 for Group SI. 
A limitation of this comparison is the 
small amount of change in the com- 
ponent means. 

A between-group comparison shows 
Group C to have the larger sum as 
well as the larger SHAB,. The 
theoretical model states that SI will 
have the greater SHAB, provided the 
sums areequal. ‘The important point 
is that the Ss of Group SI were able 
to redistribute their component pro- 
ficiencies, but were not quite able to 
maintain the control sum of compo- 
nent proficiencies. The data of Table 
6 show that the sum of the losses in 
S, H, and B is greater than the com- 
pensating “gain” in A.°® 


5 Theoretically, of course, it is also possible 





for a gain in SHAB, to be evidenced even with a 
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The sums for A and B in Table 6 
also suggest a within-group constancy 
from day to day, but in this com- 
parison SI shows the larger values. 


Discussion 


The data have provided some leads 
for later and more thorough analyses. 
Some of the leads are discussed below. 

There was no detectable, differential 
effect on daily mean performance of 
telling Ss their actual worst components, 
identifying randomly selected compo- 
nents as worst performed, and not iden- 
tifying worst components at all. This 
is a conclusion at variance with the 
general opinion that KR should be pro- 
vided whenever and wherever possible 
in order to increase over-all proficiency. 
Rather, the data of Exp. 1 suggested that 
the KR procedures will alter the per- 
formance of particular parts and com- 
binations of certain parts from trial to 
trial through a selective cueing function. 

The procedures used in both Exp. 1 
and 2 were effective in changing trial-to- 
trial scores within a day’s practice. In 
Exp. 1, Treatments RC and LC suc- 
ceeded in increasing the “lowest com- 
ponent” from Trial » to Trial 2+ 1. 
When all trials were averaged regardless 
of the schedule of identifying the com- 
ponent as LC (or RC), the mean did not 
appear to differ from the NC treatment. 
Treatments LC and RC, then, suggest 
more of a performance than a learning 
effect. The second procedure (Exp. 2) 
consisted of telling the Ss to pay twice 
as much attention to A and to adjust 
the A control twice as often (unknown 
to all Ss was the fact that scoring in A 
was made more difficult because the 
width of the scoring area was cut by one- 
half). This procedure, too, was effective 
net loss in the sum. Criterion improvement 
with such a loss would vary not only with the 
amount of decrement in 2 (S, H, A, B), but 
also, sum constant, with the relative sacrifices 
of high and low components. In the present 
case, the relative sacrifices are unfavorable, with 
B, the best performed, sacrificed less than the 
poorer components § and H;; this is particularly 
true on Day 24 when all gain in A is at the ex 


pense of S and H. 
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in that the experimental group was nearly 
able to maintain a constant level of per- 
formance in A while the control group 
was not. The experimental group, how- 
ever, showed losses in the three other 
components, suggesting that constancy 
of proficiency across the response spec- 
trum will be difficult to obtain from KR 
variations. The net loss in AB for the 
special instruction group, however, was 
substantially less than for Group C. 
With B repeatedly reported as worst 
on Day 24, Group SI demonstrated 
substantial gains in B, with losses in A 
and lesser losses in H. The consequence 
of this multiple rearrangement of scoring 
profile was greater time on target in AB, 
about equal time in HAB, but a slightly 


lesser SHAB. The changes in the part 
scores for the SI group represent a 
demonstration that variations in KR 
procedures can lead to relatively marked 
changes in performance even after much 
training has been administered. It might 
have been assumed, for example, that 
after 23 days of practice, habits had 
been established such that the pattern of 
responding might be relatively unsus- 
ceptible to modification. 

Though combinations HAB and SHAB 
did not differ greatly from group to 
group during Days 20-24, there was some 
suggestion that SHAB was less for Group 
SI. It was previously suggested (1) that 
if Ss could exchange proficiency on a well- 
performed component for proficiency on 
a less well-performed component, it was 
theoretically possible to raise the com- 
bination score provided the sum of indi- 
dual probabilities at remaining on target 
remained constant or nearly so. Group 
SI, however, appeared unable to main- 
tain the required sum in four compo- 
nents, and thus its SHAB, compared 
unfavorably with that of Group C. 
Though instruction and KR procedures 
may succeed in transferring equal 
amounts of attention from one compo- 
nent to another, it is generally unlikely 
that the transfer of scoring proficiency 
follows a simple additive equation. 
Specifically, for example, if the attention 
required to score ten units of S is applied 





to scoring in A, it is unlikely that A will 
be increased by 10 units. This would 
be true whenever scoring in A is more 
dificult than S as evaluated by per- 
forming each component alone. In the 
present study, it should be recalled, the 
scoring band of A was one-half that of S. 

Thus the data of Exp. 2 have shown 
that in a four-part skills task, proficiency 
at one part may be sacrificed for addi- 
tional proficiency on some other part, 
though the gain may not be equal to the 
sum of the component losses. The 
effects of the rearrangements are not 
simple, for certain first-order combina- 
tions (AB) may be raised and the third- 
order combinations (SHAB) may be 
lowered. Even so, in situations of this 
sort, Treatment SI could be recom- 
mended over C provided AB was a more 
important criterion of proficiency than 
SHAB. 

Where the parts of a task are differ- 
entially weighted in their contribution 
to the magnitude of the combination 
score for reasons of size of scoring band 
(as in Exp. 2) or for other physical 
reasons, the effects of KR treatments 
may be expected to be quite complex. 
It is conceivable, however, that multi- 
dimensional task situations can be em- 
ployed where the entire amount of 
proficiency sacrificed on one component 
may be applied to another component. 
Certainly further research is required 
to determine the conditions under which 
equal exchanges can or cannot take place. 

The analysis of independence among 
components led to the same sort of 
finding previously reported (1), in that 
relatively marked departures from a 
chance relationship are difficult to dem- 
onstrate. The primary dimensions of 
response, A and B, usually led to AB, > 


AB, by three or four percentage points, 
a finding reported for the Pedestal Sight 
Manipulation Test, and suspected to be 
true of the Stevens Pursuitmeter. Ad- 
ditional data on a Link Tester to the 
same effect also exist. In all of these 
tests, the fundamental relationship be- 
tween times of occurrence of scoring in 
two dimensions appears to be undis- 


6 Unpublished manuscript by the author. 
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turbed by practicing and learning over 
a number of days. The results from the 
Multidimensional Pursuit Test show that 
the KR procedures also did not alter the 
relationship between components of re- 
sponding. 

The present approach to the analysis 
of the relations among skill components 
and criterion combinations is suggestive 
of another procedure, as yet uninvesti- 
gated. The second procedure involves 
testing for component proficiency as S 
practices each component singly. Pro- 
ficiency with two or more components 
practiced simultaneously might be de- 
scribable in terms of proficiencies at 


responding to the components practiced 
separately. 


SUMMARY 


This study was concerned with determining 
techniques for rearranging the rank orders and 
altering the magnitudes of part proficiencies 
within a four-part, compensatory-pursuit task. 
Such techniques serve two purposes: (a) as aids 
in specifying part-part and part-whole relations 
or composition rules, and (b) as potential 
methods for rapidly improving the criterion 
scores of skilled Ss. 

Two variables, instructions and knowledge 
of results, were used over a period of 24 days to 
alter S’s pattern of attending to the four aspects 
of the task. The results indicated that these 
variables do lead to redistributions of part pro- 
ficiencies and have very predictable effects upon 


criterion scores. For example, identifying the 


EDWARD A. 


BILODEAU 


worst component, say A, for S leads to immediate 
improvement in that part, and usually in a 
combination score such as AB. In the present 
data the improvement in one part was accom- 
panied by a loss or losses in other parts such that 
proficiency at doing four parts simultaneously 
tended to fall slightly. It was suggested that 
these and other processes can be studied to 
advantage by means of the multiplication of 
individual component proportions of time on 
target. 

The data failed to reveal day-by-day dif- 
ferences three knowledge of results 
procedures which identified the worst com- 
ponent, a component selected at random and 
represented to S as worst, and no component 
at all. Since these procedures gave differential 
results at the trial level and no differences for 
aggregates of trials, it was suggested that they 
had momentary cue value as opposed to learning 
effects. 


between 
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